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SECTION  1 

RACK  C, ROE  ND  ANI)  INTRODUCTION 


RACKGROl  ND 

Hu-  performance  modeling  and  simulation  project  (572 O')  currently 
underway  at  M1TRK  has  as  its  objective  the  creation  of  a  C'?  simulation 
capability  during  the  Conceptual  Phase  of  system  acquisitions  on  the  part  of 
the  ESD  Sites.  The  project  is  thus  attempting  to  identify  appropriate  tools 
(models  or  modeling  systems!  that  may  be  employed  early  in  the  acquisition 
cycle.  Another  goal  of  this  project  is  the  specification  of  guidelines  for 
systems  simulation. 

With  these  goals  in  mind,  two  surveys  were  undertaken  at  the  outset  of 
this  project.  The  first  was  a  survey  of  the  current  uses  of  C*  simulation 
during  the  Conceptual  Phase,  along  with  an  analysis  of  selected  models.  The 
second  survey,  which  forms  the  content  of  this  paper,  focused  on  models 
that  may  be  employed  for  the  design  of  computer-communications  networks . 

Several  models  were  identified  which  might  be  employed  for  this  pur¬ 
pose  and  analyses  of  their  algorithmic  content  are  presented  as  the  heart  of 
the  report.  Summary  sections  are  also  included  presenting  the  author's 
views  on  the  utility  of  each  model.  An  overall  summary  is  presented  at  the 
end  of  the  report. 

It  is  important  to  point  out  that  to  our  knowledge,  none  of  these  models 
are  being  employed  on  a  current  ESD  problem.  (This  remark  also  applies  to 
the  models  identified  in  the  first  survey.  )  However,  plans  are  currently 
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tieing  made  for  the  application  of  one  or  more  model(s)  identified  in  these 
surveys  to  such  a  problem. 


One  of  the  more  promising  candidate  problems  is  the  design  of  a  tacti¬ 
cal  distributed  data-base  system  (Project  522G).  This  problem  involves  the 
design  of  Ixith  local  and  gloivtl  networks.  Hence,  it  is  felt  that  an  attempt 
should  lie  made  to  discover  any  special  models  that  are  particularly  relevant 
in  the  design  of  local  networks. 
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INTRODUCTION 


An  area  of  fundamental  importance  in  the  construction  of  C  systems  is 
most  certainly  the  ability  to  effectively  design  a  computer-communications 
network.  Indeed  these  networks  might  properly  be  recognized  as  forming  for 
all  practical  purposes,  the  central  nervous  system  of  our  military  command 
and  control  system.  The  world-wide  AUTODIN  II  network  and  the  SACDIN 
network  testify  to  the  reliance  of  the  military  upon  these  networks. 

3 

In  spite  of  a  rapidly  progressing  technology,  C  system  requirements 
place  a  heavy  load  upon  the  designer  of  such  a  system.  Among  the  require¬ 
ments  which  must  be  placed  upon  such  a  system  are : 

•  Excellent  Response  Time /Throughput  Characteristics  —  In  a 
time  of  crisis,  no  possibility  of  a  system  overload  can  be 
tolerated.  A  fast  response  time  is  mandatory. 

•  System  Survivability  —  The  system  must  be  made  as  resistant 
to  interference  as  possible. 

•  Expandability  —  As  technology  changes,  and  as  political  and 
military  considerations  vary,  the  ability  to  reconstruct  and 
update  the  system  is  an  important  consideration. 

•  Cost-effectiveness  —  In  spite  of  the  importance  of  the  above- 
listed  requirements,  the  ravages  of  inflation  simply  do  not 
permit  unlimited  funds  for  the  inclusion  of  all  desirable 
characteristics  in  any  military  system. 

Thus,  an  excellent  system  design  is  required,  coupled  with  an  ability 
to  maintain  the  system  efficiently  and  expand  it  as  necessary.  Given  the 
technical  complexity  of  the  design  questions,  one  is  led  to  search  for  auto¬ 
mated  design  tools  that  can  take  some,  if  not  most,  of  the  burden  off  the 
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designers  of  the  system.  A  number  of  such  tools  do  indeed  exist.  It  is  the 
purpose  of  this  paper  to  describe  and  evaluate  the  utility  of  these  tools  in  the 
context  of  the  design  of  a  (''*  system.  The  emphasis  is  upon  a  comparison  of 
the  algorithms  employed  In  the  various  design  phases. 

With  this  purpose  in  mind,  the  paper  is  organized  along  the  following 
lines.  Section  1  provides  a  general  background  on  computer-communications 
networks  —  their  classification,  the  technologies  employed  in  their  construc¬ 
tion,  and  the  fundamental  design  issues  and  difficulties  involved  in  addressim 
these  issues.  Section  2  then  presents  a  collection  of  algorithms  that  are 
commonly  employed  in  the  construction  of  these  networks.  Section  3  provides 
detailed  descriptions  of  tne  software  packages  presently  available.  Finally, 
Section  -4  is  a  summary  ,  in  general  terms,  of  the  results  of  the  analyses. 

Computer  Networks 

A  wide  variety  of  computer  networks  are  in  existence  today,  ranging 
from  intra-companv  networks  that  might  be  used  for  stock-control  and  dis¬ 
tribution,  to  packet-switched  networks  such  as  ARPANET  (ARPA  Network), 
whose  purpose  is  to  interconnect  a  collection  of  heterogeneous  computers  in 
the  1  .  S.  so  that  users  and  programs  at  one  center  will  have  access  to 
facilities  available  at  other  centers.  The  CV LADES  network  in  France  is 
another  example  of  a  packet-switched  network. 

On  a  functional  basis,  it  is  common  to  distinguish  between  three  types 
of  networks:  Remote-Access  Networks  (RAN),  Value-Added  Networks  (VAN), 
and  Mission-Oriented  Networks  (MON). 

The  Remote-Access  Networks  are  essentially  designed  to  support  com¬ 
munications  between  a  user  and  a  host  computer.  Commercial  time-sharing 
systems  such  as  TYMNET  (Tymshare,  Inc.)  and  INFO  NET  (Computer 
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Sciences  Corporation)  fall  into  this  category.  Another  well-known  example 
is  the  Sri'A  (Societe  Internationale  de  Telecommunications  Aeronautiques) 
network,  used  to  interconnect  computers  handling  airline  reservations  for  a 
group  of  some  1  (»0  carriers  around  the  world. 

The  Value-Added  Network  supports  communications  between  computers 
rather  than  between  users  and  host  computers.  The  well-known  example  of 
such  a  network  is  the  ARPANET,  which  was  developed  to  provide  enhanced 
computational  resources  to  researchers  at  a  variety  of  different  institutions. 
For  example,  the  MIT  Multics  time-sharing  system  is  available  through  the 
net,  as  is  the  ILLIAC  IV  at  NASA’s  Ames  Research  Center  and  an  IBM 
360/91  at  U.  C.  L.  A.  The  network,  as  a  VAN,  has  among  its  capabilities  the 
ability  to  support  long  file  transfers,  the  querying  of  remote  data-bases,  and 
multi-processing,  which  can  occur  at  geographically  distinct  sites.  A  logical 
map  of  the  ARPANET  appears  in  figure  1. 

In  implementing  a  VAN,  one  creates  a  subnetwork  of  computers  whose 
function  is  strictly  within  the  communications  realm  —  the  transfer  of  mes¬ 
sages  between  the  "host"  processors  (e.g.  ,  the  ILLIAC  of  ARPANET). 
Appropriately  enough,  this  net  is  referred  to  as  the  communications  sub-net 
and  the  computers  utilized  therein  are  referred  to  as  communications 
processors. 

In  the  ARPANET  the  communications  processors  are  referred  to  as 
IMPs  (Interface  Message  Processors).  In  the  event  that  the  IMPs  are  con¬ 
nected  only  to  terminals ,  and  not  to  host  computers ,  they  are  then  referred 
to  as  TIPs  (Terminal  Interface  Processors). 

A  geographical  map  of  the  ARPANET,  as  of  June  1975,  is  included  as 
figure  2. 
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Figure  2.  Geographic  Map  of  ARPANET,  -June  197 


The  highest  level  of  our  taxonomy  is  occupied  by  the  Mission-Oriented 
Network  (MONK  The  distinction  between  a  VAN  and  MON  is  strictly  organi¬ 
zational.  Rather  than  have  a  diverse  collection  of  users  attempting  to  access 
the  communications  subnet  for  their  own  purposes,  the  hosts  and  the  subnet 
operate  under  the  control  of  one  single  organization.  The  proposed  SAC’DIN 
network  is  an  example  of  such  a  network,  as  is  the  AUTOPIN  network. 

Figure  .'1  is  a  logical  map  of  Al’TODIN. 

There  arc  three  separate  point-to-point  transmission  technologies 
presently  employed  in  networks: 

1.  Circuit  (or  Line)  Switching 
Message  Switching 

.1.  Packet  Switching 

Figure  I  is  a  diagram  taken  from  KLKI7(>  that  compares  the  network  delay 
time  for  the  three  types  of  transmissions. 

In  circuit  switching,  the  entire  path  Ix'tween  the  sender  and  receiver 
(the  two  hosts,  in  our  case)  is  set  up  prior  to  message  transmission  by  a 
special  signal  that  makes  its  way  through  the  network,  seizing  available 
channels  as  it  goes.  In  the  event  that  a  successful  path  is  established  through 
the  network,  a  return  signal  is  sent  back  through  the  network  indicating  that 
transmission  may  begin.  The  entire  path  between  the  two  users  is  then  com¬ 
pletely  dedicated  to  this  transmission.  This  is  the  approach  employed  by  the 
telephone  network.  The  DAT  RAN  network  f  FLAT74  1  (lie  fore  it  passed  on), 
was  an  example  of  a  circuit  switched  data  network.  In  general,  t he  set-up 
time  for  a  dial-up  circuit  is  on  the  order  of  5-20  seconds.  Nevertheless,  the 
increasing  speeds  of  transmission  which  came  about  as  a  result  of  improved 


Figure  3.  Logical  Map  of  AUTODIN 


switches  made  the  DATRAN  network  an  operational  possibility.  The 
interested  reader  should  consult  F1.AT74. 

in  message  switching,  the  entire  message  is  first  sent  from  the 
"sender"  to  a  predetermined  node.  The  message  is  then  sent  along  the  next 
channel  of  its  journey  to  a  third  node,  and  so  forth,  until  it  eventually  reaches 
its  destination.  If  a  channel  is  busy,  t he  message  simply  waits  in  an  output 
buffer  until  the  appropriate  channel  is  idle,  or  it  may  l*>  routed  along  an 
alternate  channel.  The  SIT  A  and  AFTODIN  networks  are  both  message - 
switched  networks. 

Packet  switching  differs  from  message  switching  in  that  each  message 
is  broken  into  small  segments  (or  packets)  of  predetermined  length.  Then, 
the  packets  are  sent  independently  of  one  another  in  "store-and-fonvard” 
fashion  through  the  network.  (Doth  message  and  packet  switching  are  referred 
to  as  "store  and  forward"  switching.  ) 

Figure  1  illustrates  one  of  the  major  advantages  of  packet  switching  — 
namely  the  "pipelining"  effect.  The  essence  of  this  effect  is  that  if  the 
message  length  is  not  too  long,  the  message  delay  time  for  packet  switching 
will  in>  considerably  less  than  for  circuit  switching,  and  will  also  be  less  than 
lor  message  switching.  Indeed,  the  delay  time  for  message  switching  is 
found  to  tie  proportional  to  the  product  of  the  message  length  and  the  number 
ot  hops  required,  while  the  delay  time  for  packet  switching  is  proportional 
to  the  product  of  packet  length  and  the  number  of  hops,  plus  a  term  propor¬ 
tional  to  the  message  length. 

If  the  transmission  requirements  consist  basically  of  long  messages 
(e.g.  ,  file  transfers),  then  circuit  switching  would  be  the  preferred  approach. 
On  the  other  hand,  in  a  somewhat  more  mixed  ("bursty")  environment. 
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consisting  of  both  long  file-transfers  and  brief  interactive  messages, 
packet  switching  provides  a  very  effective  approach. 

A  number  of  discussions  of  packet  switching  have  appeared  in  the 
literature.  The  interested  reader  might  wish  to  consult  CHOW75  or  K l..E17(>. 

The  ARPANET  is  the  best-known  example  of  a  (distributed)  packet  - 
switched  network.  As  indicated  by  the  ARPA  geographical  map  (figure  2), 
the  network  links  approximated  100  computers  throughout  the  I'nited  States 
to  one  another  as  well  as  to  some  sites  in  Hawaii  and  Europe  (achieved  by 
satellite).  Each  host  computer  is  connected  to  an  HUP  via  a  100  KR  sec 
asynchronous  channel.  The  IMPs  are  then  connected  to  one  another  via 
synchronous  f>0  KR  sec  full  duplex  channels.  Each  host  computer  has  a 
network  control  program  resident  in  its  operating  system.  This  program 
allows  host  computers  to  communicate  with  one  another  via  a  host-host 
protocol.  Another  program,  TELNET,  serves  as  an  interface  between  the 
user  and  the  network  control  program. 

In  order  for  a  user,  located  at  a  terminal,  to  access  a  remote  host 
computer,  the  user's  byte  stream  is  first  broken  into  messages  (maximum 
size  approximately  8000  bits)  by  the  user's  host  computer  and  is  then 
delivered  to  its  associated  IMP  (along  with  the  message  destination).  The 
IMP  then  breaks  the  message  into  packets  (maximum  size  1008  bits),  deter¬ 
mines  the  route  for  each  packet,  provides  error  control  and  message  buffer¬ 
ing  "services,"  and  handles  acknowledgments  of  messages  from  host  to  sender. 

In  the  last  few  years,  interest  has  focused  on  the  possibilities  of  com¬ 
bining  circuit  switching  and  packet  switching  in  one  network.  The  circuit 
switching  would  be  employed  for  long  file  transfers,  while  the  packet 
switching  would  lx'  utilized  for  the  shorter,  interactive  messages.  Some 
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studies  have  attempted  to  determine  which  sort  of  messages  would  be  tx'st 
suited  for  each  id'  these  technologies  f  CFOS72  ]  ,  while  others  have  attempted 
to  analyze  the  performance  of  these  networks  f  KlSt'Tb]  . 


Further  complicating  the  issue  is  the  possibility  of  integrating  Udh 
voice  and  digital  networks,  as  well  as  the  possibility  of  digitizing  and  packet- 
i/ing  voice  traffic.  Finally,  using  satellites  as  transmission  facilities  lends 
yet  another  level  of  complication  to  the  choice  of  switching  technologies. 

Design  Considerations 

In  designing  a  network,  one  can  distinguish  between  two  general  prob¬ 
lem  areas  —  global  and  local.  The  primary  glotial  problems  to  bo  addressed 
are : 

•  Capacity  Allocation  —  How  much  capacity  (bit  sec)  does  one 
allocate  to  each  communications  line? 

•  Routing  Algorithms  —  What  is  the  most  efficient  wav  to  route 
messages  between  users  and  resources?  Should  the  routing 
bo  static  or  adaptive  (that  is,  adjust  to  changing  conditions)? 

Routing  is  especially  important  in  the  design  of  distributed 
networks,  a  point  that  will  be  emphasized  later. 

•  Network  Topology  —  Given  the  user  (terminal)  locations,  the 
resource  locations,  and  the  traffic  statistics,  how  can  one 
locate  concentrators  and  multiplexors,  and  then  connect  the 
resources  with  the  users  in  a  cost-effective  and  reliable 
manner'.’  Some  well-known  topologies  are  represented  in 
figure  :».  A  "typical"  configuration  which  characterizes  either 
a  local  access  or  centralized  topology  is  shown  in  figure  t>. 


T  erminals 


c.  Loop  configuration 


SI  HW77.  i'.  4.  Reprinted  hy  permission. 


Figure  5.  Network  Topologies 


•  Reliability  —  How  does  one  ensure  that  there  will  be  alternate 
communication  paths  in  the  event  that  some  collection  of  nodes 
and  links  is  not  available? 

Among  the  local  questions  to  be  answered  are: 

•  What  access  techniques  should  be  utilized  for  line  control 
(e.  g.  polling  or  contention)? 

•  What  buffer  sizes  and  loading  algorithms  should  be  employed 
when  choosing  the  appropriate  concentrators? 

•  Should  multiplexing  or  concentration  of  messages  be  employed0 

The  major  performance  criteria  used  to  evaluate  network  performance 
(which  must  therefore  provide  the  standards  against  which  the  above  con¬ 
siderations  are  to  be  evaluated)  are: 

•  Time  Delay  —  Is  the  delay  between  the  origination  and  receipt 
of  a  message  within  acceptable  limits? 

•  Throughput  —  Can  the  network  handle  the  "busy-hour"  load  as 
well  as  the  "normal"  traffic  load  without  the  introduction  of 
lost  messages  and  excessive  delay  times,  not  to  speak  of 
deadlocks  ? 

•  Reliability 

•  Network  Cost 

•  Sensitivity  —  What  is  the  sensitivity  of  the  particular  network 
to  variations  in  traffic  loads,  equipment  available,  etc.  ? 

That  is,  how  "robust"  a  design  has  been  produced  for  the 
network? 


A  formulation  of  the  design  problem  for  computer-communication 
networks  might  ix>  phrased  as  follows  [PICK76,  p.  54 )  : 

(’liven:  Location  of  data  sources  and  sinks,  and  the 

required  data  flow  I  between  them. 

Find:  Minimum  cost  network  to  meet  these  require¬ 

ments  by  choosing  the  topology ,  location  of 
multiplexing,  concentrating  and  switching 
devices,  capacities  of  the  links  and  the  routing 
strategies;  subject  to 

Constraints:  Reliability,  Delay,  Flow 

In  order  to  avoid  any  optimism  about  the  nature  of  this  problem,  some 
of  the  difficulties  inherent  in  its  solution  should  be  pointed  out.  These  include 

•  Number  of  Possible  Networks  —  It  can  be  shown  that  there  are 
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ways  of  arranging  M  links  among  N  nodes.  Obviously, 
a  brute  force  ’’look  at  everything"  approach  is  out  of 
the  question. 

•  Discrete  Elements  —  Capacities  for  lines  yand  other  elements) 
come  in  disci’ete  sizes.  For  example,  line  speeds  can  be 
2400,  3600,  50,000  bits  sec.  Since  integer  programming 
techniques  are  not  sufficiently  advanced  to  handle  problems 
of  ordinary  size,  approximate  methods  are  forced  on  us. 
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•  Nonlinearities  —  Time-delay  functions,  component-cost 
structures,  etc.  are  nonlinear.  Hence,  we  are  faced  with 
the  difficult  area  of  nonlinear  optimization.  The  nonlinear 
functions  are  neither  the  concave,  nor  convex  functions  for 
which  algorithms  exist. 

The  above  difficulties  are  in  reality  but  a  sample  of  the  true  difficulties 
associated  with  the  global  design  of  a  computer-communications  network. 

I  he  heart  ol  the  dilemma  lies  in  the  fact  that  all  the  problems  are  inter¬ 
related.  The  routing  affects  the  choice  of  capacities,  and  vice  versa.  The 
choice  of  topology  is  dependent  on  both  capacity  and  routing.  Another  area, 
reliability,  is  ill-understood  and  at  present  only  approachable  via  brute- 

force  techniques.  The  design  problem  is  greatly  compounded  in  the  case  of 
* 

large  networks.  Other  complicating  factors  include  a  tariff-structure  that 
constantly  changes,  the  incorporation  of  satellites  in  communications  networks, 
proposed  mixtures  of  voice  as  well  as  data  networks,  and  the  integration  of 
packet  and  circuit  switching  in  the  same  network. 

The  approach,  then,  is  to  attempt  to  (rationally)  separate  the  design 
considerations,  and  then  to  systematically  iterate  towards  a  solution.  All 
hope  of  an  optimal  solution  must  lx.'  abandoned,  and  replaced  with  a  willingness 
to  accept  an  heuristically  produced,  yet  feasible  design. 


The  paper  11SIE76  provides  an  illuminating  discussion  of  the  difficulties 
involved  in  the  design  of  a  large  network  as  well  as  algorithms  for 
switch  location. 
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DESIGN  OK  COM Pl'TER-GOMMl'NICAT IONS 
NETWORKS  -  ISSi  ES  AND  APPROACHES 


In  this  section,  we  outline  some  of  the  considerations  associated  with 
the  design  of  computer  networks,  and  present,  in  summary  form,  some  of 
the  ma  jor  algorithms  involved  in  the  design  of  centralized  and  distributed 
networks. 

A  discussion  of  several  simulation  approaches  to  reliability  is  presented 
in  section  3  as  part  of  the  discussion  of  Network  Analysis  Corporation's 
(NAG's)  models.  We  chose  to  present  it  in  section  II  because  NAG's  models 
represent  the  state  of  the  art  in  this  area.  A  detailed  survey  of  (he  subject 
may  be  found  in  FRAN70a. 

One  distinguishes,  in  general,  between  two  types  of  networks  —  cen¬ 
tralized  and  distributed.  The  fundamental  difference  lietwcen  these  two  types 
ot  networks  is  reflected  by  their  names.  In  a  centralized  network,  there  is 
one  computational  facility  and  all  the  users  arc  attempting  to  gain  access  to 
it,  while  a  distributed  network  has  a  numlter  of  (linked)  computational  facil¬ 
ities  that  a  number  of  users  are  attempting  to  access. 

Among  the  potential  topologies  for  a  distributed  network,  two  stand 

out  at  opposite  ends  of  the  continuum  —  the  ring  network  and  t he  completely 

connected  network.  These  topologies  are  illustrated  in  figure  7. 

♦ 

We  take  this  opportunity  to  remind  the  reader  that  we  are  really  talking 
about  the  organization  of  the  communications  subnetwork,  in  spite  of  our 
casual  use  of  the  word  "network.  " 


5V3«t 


Tin*  ring  network  minimizes  line-related  costs  l>nt  presents  some 
potential  reliabilitv  problems,  sinee  one  line  (or  node)  failure  threatens  to 
dtseonneet  the  entire  network  (unless  traffic  can  he  reversed  along  duplex 
transmission  lines).  FAR !!"'.!  provides  a  discussion  of  a  ring  network.  The 
completely  connected  network  possesses  no  such  line  failure  problems,  but 
docs  present  some  interesting  cost  considerations. 

With  (lie  advent  of  large  networks  (100  or  more  computer  facilities, 
and  associated  terminals),  the  notion  of  hierarchies  within  (lie  network  has 
sprung  up.  The  hierarchy  consists  of  a  high-level  backbone  of  "important" 
nodes  (implemented  as  a  packet-switching  network),  and  a  second  tier  of 
local-access  networks  (which  might  lie  implemented  as  centralized  networks). 
The  interested  reader  should  consult  (tKRI.7!»  for  a  discussion  of  the  issues 
and  problem  areas  of  large-scale  network  design. 

We  proceed  then  to  discuss  centralized  and  distributed  networks,  and 
present  some  salient  algorithms  associated  with  their  design.  Since  design 
algorithms  are  often  stated  in  graph-theoretical  terminology,  a  few  (very) 
elementary  definitions  precedi'  our  discussion. 

A  graph  is  essentially  a  collection  of  nodes  (or  vertices)  and  links  (or 
arcs).  Symbolically,  (1  (V,  D  where  V  is  the  set  of  vertices  and  F  is  the 

set  of  arcs.  In  our  situation,  the  nodes  represent  the  concentrators  or  com¬ 
puters,  while  the  arcs  represent  the  transmission  lines.  A  path  is  a  sequence 
of  arcs  on  the  graph.  (An  example  of  a  path  is  a  ,  a.(,  a„  in  figure  Sa.  )  The 
ring  and  completely  connected  networks  of  figure  7  provide  two  examples  of 
graphs.  The  ring  network  is  referred  to  as  a  cycle  in  graph-theoretic  ter¬ 
minology  I'he  completely  connected  network  is  called  a  completely  connected 
graph  on  six  vertices.  A  completely  connected  graph  on  n  vertices  has  each 
of  its  nodes  connected  to  all  the  remaining  nodes  in  the  graph. 


Graphs  are  often  distinguished  as  dl rooted  and  undirected.  por 
example,  the  graph  in  figure  8a  is  directed  (each  of  its  arcs  has  an  arrow 
on  tt\,  while  the  graph  of  figure  Sh  is  undirected.  Physically,  this  might 
correspond  to  simplex  transmission  lines  oriented  in  the  direction  indicated 
hy  the  arrows.  The  reader  will  note  that  lx>th  of  these  graphs  are  cycles. 

A  connect <'d  graph  is  one  in  which  there  is  a  path  between  every 
pair  of  vertices.  The  graph  of  figure  !»  is  disconnected.  The  notion  of  con¬ 
nectivity  in  a  graph  has  Important  Ix'aring  on  the  design  of  survivnble 
(nonvulnerable)  communications  networks.  4 


Figure  Graph  with  Two  Components 


A  tree  is  a  connected  graph  with  no  cycles :  it  can  be  shown  that 
this  is  equivalent  to  requiring  the  (connected)  graph  to  have  exactly  one  path 
between  any  two  nodes.  This  last  property  turns  out  to  be  a  very  important 
one  in  the  design  of  centralized  networks. 

DESIGN  OF  CENTRALIZED  NETWORKS 

In  general,  the  approach  to  designing  a  centralized  network  is  to 
attempt  to  define  a  tree  topology.  Conceptually,  this  is  a  very  appealing 
approach,  as  it  is  an  easy  matter  to  generate  the  minimal  spanning  tree  of 
a  graph  (i.  e.,  a  tree  that  will  connect  all  the  nodes  and  have  the  shortest 
total  arc  length).  Such  a  tree  corresponds  physically  to  connecting  all  the 
nodes  with  the  shortest-distance  communications  links.  We  will  discuss 
this  algorithm  later  in  this  section. 

It  should  be  made  clear  from  the  outset  that  this  is  not  necessarily 
the  ideal  or  "optimal"  topology,  by  any  means.  Frank  and  Chou,  in  their 
paper  "Topological  Optimization  of  Computer  Networks"  [  FRAN72  ]  give 
several  examples  of  tree  topologies  that  are  not  minimal.  The  example 
presented  in  figure  10  is  taken  from  this  paper.  As  implied  by  the  monthly 
line  costs  chart,  we  are  given  the  option  of  connecting  nodes  1  and  2  (the 
users)  to  the  central  node  (3)  via  either  19.  2  or  50  KB  lines.  It  turns  out 
to  be  more  economical  to  connect  the  nodes  in  the  fashion  indicated  in 
part  (c)  than  via  the  tree  topology  of  part  (b):  this  of  necessity  forces  us 
to  route  some  of  the  flow  between  nodes  1  and  3  via  node  2  (at  least  5.  8  KB/sec). 

Another  example  mentioned  in  the  same  paper  points  out  that  a 
minimal  spanning  tree  can  be  improved  via  the  addition  of  so-called  "Steiner 
Points"  [  FRAN 72 ,  pp.  1388-1389). 


Mil  t  S 


1000  Mills 


SO  Ml 


?00  2 
MILES 


OIS1  AMIES 
(a) 


!<0  Ml 


cost  S.u  .’oo 
(111 


(.■I 


1  HAT  f  |C  MAI  MIX  (Kill’s) 


NODES  1  i 


1  0 


0 

0 

40 


i 


3 

.’!> 

40 

0 


MON  1HI  V  I  INI  COSTS 


(SI 

COS  1 

CAI’ACIt  V 

DAI  A  St  1 

I’l  II  Mill 

m  2 

8!i0  0 

$7  50 

tiO  0 

850  0 

SI 5  00 

Figure  1(>.  I'ent  rail  zed  Netw  ork  Kxample 

In  spite  ot  these  caveats,  a  tree  topology  does  possess  certain  definite 
advantages.  Most  notable  among  them  are: 

•  II  link  costs  are  convex  functions  of  capacity,  then  some  tree 
solution  is  indeed  optimal  |  Y.U'.KT t  ]  .  As  pointed  out 
previously,  however,  link  costs  are  otten  discrete  functions. 

•  In  a  tree  topology,  there  is  a  unique  path  In’tween  the  user  and 
the  computer.  This  obviates  the  need  to  consider  routing 
algorithms  in  determining  the  network  topology .  and  makes 
life  much  simpler  for  the  network  designer. 


•  In  tho  event  that  (ho  link  charges  arc  high,  i(  is  sensible  to 


minimize  tho  nuinhor  ami  length  of  tho  edges.  In  snoh  a  ease, 
a  troo  topologv  should  lx'  considered  [HOTH71  ]  . 

•  It  can  l>o  shown  [KOTH711  that  given  am  "optimal"  tojvology, 
thoro  o\ists  a  troo  within  tho  not  work  snoh  that  all  unsaturatod 
links  aro  in  tho  troo.  llonoo,  tho  "optimal"  topology  doviatos 
from  a  troo  topology  to  tho  oxtont  that  saturated  links  are 
added  to  it . 

A  ivirthor  note  ot  optimism  inserts  itself  when  wo  not*'  that  oven  in  tho 
event  that  one  is  willing  to  rest  riot  one's  options  to  trees,  a  globallv  optimal 
solution  still  eannot  U'  fouml.  it  is.  however,  possible  to  assign  eapaoittes 
optimally  to  a  given  tree  tojvology ■.  \  design  procedure  for  centralized  t\et 

works  has  boon  devised  based  on  this  algorithm.  KHAN 7 lb  contains  atv 
interesting  discussion  of  this  approach. 

We  restrict  ourselves  in  this  discussion  to  a  network  design  technique 
which  consists  ol  two  fundamental  stages: 

1.  An  algorithm  that  locates  tho  position  of  concentrators  w  ithin 
a  network. 

2.  Several  algorithms  that  are  used  to  connect  tho  terminals  to 
the  concent  raters. 

As  promised,  the  design  procedure,  even  in  tho  "simple"  centralized  ease, 
is  hierarchical  in  nature. 

I'ho  algorithms  presented  will  tx'  referenced  in  tho  next  section,  when 
they  are  described  as  part  of  the  comparative  study  of  tho  software  packages 
examined  during  (ho  course  of  tho  survey. 


Location  of  Concentrators 


The  two  algorithms  described  here  are  referred  to  as  the  "add"  and 
"drop"  algorithms  in  the  literature.  The  add  algorithm  appears  to  lie  the 
most  commonly  used  algorithm  for  locating  the  position  of  concentrators 
within  a  network.  Our  description  will  follow  that  of  Schwartz  fSCHW'771  . 

The  statement  of  the  problem  is: 

Given :  ( 1 )  A  set  of  n  terminals  T . T  )  .  and  their  locations. 

I  1  n  I 

(2>  A  central  facility  S^. 

(3)  A  potential  set  of  concentrator  locations  !s  .....  S  ^  . 

i  1  ml 

Choose:  A  subset  of  the  concentrator  locations,  and  connect  the 

terminals  to  S  via  the  concentrators  chosen  or 

directly  to  S  . 

0 

Note  that  the  concentrators  are  to  lie  connected  to  S ^  via  a  high-speed 
line,  to  accommodate  the  increase  in  traffic,  while  the  terminals,  when 
directly  connected  to  S  ,  will  utilize  low-speed  lines. 

The  following  restrictions  apply  to  the  problem: 

•  Concentrator  S.  has  a  fixed  cost  f  ,  which  consists  of  both 

.1  .i 

the  hardware  and  t he  cost  of  connecting  it  to  the  central 

facility  S„.  (f  =01 
0  '  0 

•  Each  concentrator  has  a  maximum  "port"  capacity  e.  The 
port  capacity  refers  to  the  maximum  number  of  terminals 
that  the  given  concentrator  can  accept,  e  is  assumed  n. 
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Iii  practice,  of  course,  the  port  capacity  can  vary  depending 
upon  the  concentrator  chosen,  but  this  results  only  in 
further  extensions  of  the  algorithm. 


•  c.  is  the  cost  of  connecting  terminal  i  to  concentrator  j. 

This  results  in  an  m  \  n  cost  matrix  (array). 

I'o  formulate  the  problem  mathematically,  we  introduce  the  follow¬ 
ing  notation: 

1  if  T.  is  connected  to  S  . 

>  ) 

0  otherwise  . 


Concentrator  S.  is  said  to  be  open  if  it  is  in  use 
J 

not  in  use.  Defining 


and  closed  if  it  is 


v  = 
i 


x,.  >  0 
t.I 


i- 1 


0  otherwise 


we  note  that  v.  1  if  S  is  open  and  y  0  if  S  is  closed 
'  j  .1  i  i 

betting  7  lx1  the  cost  function,  we  find  the  following  representation 

for  the  problem: 

Minimize : 


m  n  m 

z  Ev.  ££ 

j  o  i  i  j=n 


X, .  c  . 

t.I  1.1 
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The  first  term  is  the  cost  of  the  concentrator,  while  the  second  represents 
the  cost  of  the  terminal-to-eoncentrator  links. 

'/  is  to  be  minimized  subject  to  the  following  two  constraints: 


m 


j  0 

bach  terminal  i  must  l>o  connected  to  some  concentrator. 

(This  includes  the  possibilin  of  the  central  facility  S. ) 

n 

2‘SXij  “  e>  j  =  1>  •••  ’  m  • 

i-1 

No  more  than  e  terminals  may  be  connected  to  concentrator  j. 

When  the  problem  is  posed  in  the  above  manner,  we  have  on  our  hands 
a  0-1  integer  programming  problem.  While  it  is  possible  to  employ  branch- 
and-bound  techniques  in  solving  it,  these  are  very  time-consuming,  and  the 
solution  represents,  in  fact,  only  a  portion  of  the  overall  network -planning 
problem.  Hence,  the  algorithm  to  bo  described  turns  out  to  lx?  heuristic 
in  nature. 

We  chose  to  describe  the  algorithm  using  an  example  taken  from 

SCHW77.  Six  terminals,  T  ...  ,  T  are  to  be  connected  to  three  concen- 

I  (> 

trators,  S^,  SiJt  S^,  or  to  the  central  node,  S^,  as  shown  in  figure  11. 


ADI)  Algorithm 


We  initialize  the  algorithm  by  connecting  all  of  the  terminals  to 


S 

0 


Hence  all  of  the  concentrators  are  closed.  The  total  cost  is  therefore 


i 


7  E',0  " 

i  1 


(refer  to  the  cost  matrix).  This  is  illustrated  by  figure  12. 


f  igure  12.  Initialization  of  Add  Algorithm 

We  proceed  by  opening  up  the  concentrators  one  at  a  time,  and  search¬ 
ing  for  the  maximum  decrease  in  cost. 

Iteration  1.  We  open  up  each  concentrator,  one  at  a  time,  to  determine 


that  location  (for  the  first  concentrator)  which  will  produce  the  greatest 
decrease  in  cost. 


Thus,  opening  S  ,  we  connect  the  terminals  that  produce  the  greatest 
cost  savings  (c  -  c^l  up  to  a  maximum  of  e  =  3  terminals.  The  terminals 
chosen  for  connection  to  S ^  are  T^,  TtJ,  and  T^.  The  remaining  terminals 
are  connected  to  S  . 

6  3 

m-nco,  Zl'2CiO*'l  *2C|l'n 

i=4  1=1 

Next,  we  open  up  S  ,  and  discover  that  terminals  3  and  6  should  be 
connected  to  S  .  In  this  case  Z0  -  13. 

Finally,  we  open  up  S  ,  and  discover  that  terminals  3,  5,  and  6  should 
be  connected  to  it.  In  this  case  =  10. 

Since  7.  represents  the  smallest  cost,  we  connect  the  terminals  as 
shown  in  figure  13. 


Figure  13.  Add  Algorithm,  Iteration  1 


Iteration  2.  We  now  leave  S  open,  and  check  the  other  two  concentrator 
locations,  S(  and  S,)t  successively  to  see  if  any  improvement  in  cost  results. 
Note  that  we  are  going  to  check  all  of  the  terminals,  including  those  already 
connected  to  S.{. 

By  connecting  T  ,  T„,  and  T,{  to  we  find  a  decrease  of  three  cost 
units  and  an  increase  of  two,  resulting  in  a  net  decrease  of  one  cost  unit  from 
this  arrangement.  Hence  /  =  9  at  this  stage. 

Repeating  the  procedure  for  S0,  we  produce  no  improvement  in  our  cost 
structure.  Therefore  we  terminate  the  algorithm  with  7.  =  9  and  with  the 
arrangement  of  terminals  and  concentrators  shown  in  figure  14. 


<  !  H.im-J  mu  M  H\N  **-  199 


Figure  14.  Add  Algorithm,  Iteration  2 

It  is  the  author's  observation  that  the  "add"  algorithm  is  really  nothing 
more  than  the  Vogel  solution  to  the  allocation  problem.  Another  algorithm 
in  use  for  this  problem  is  the  so-called  "drop"  algorithm. 
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I  In*  (i! op  al  go  1  i t h in  starts  with  .'ill  of  the  concent rntors  left  open t  unci 
then  systematically  closes  one  at  a  time,  determining  the  cost  improvement 
at  each  iteration,  until  no  further  improvement  results. 

Initialization.  The  first  step  is  to  examine  (lie  cost  matrix  previously 
developed  for  terminal-concent rator  connections  and  to  find  (lie  cheapest 
possible  concentrator  to  which  to  connect  each  terminal.  The  result  is 
depicted  in  the  following  matrix,  where  a  I  in  the  concentrator  column  sig¬ 
nifies  llu‘  decision  to  connect  a  given  terminal  to  that  particular  concentrator. 
This  process  takes  into  account  t he  concentrator's  port  capacity. 


S 

i 

o  i  2  ;t 


This  choice  results  in  t he  configuration  shown  in  figure  1!>,  with  a 
cost  of  /,  II, 


The  choice  of  concentrator  2  for  terminal  l  is  arbitrary,  as  we  could 
have  chosen  concentrator  0. 


INITIALIZATION.  Z  11 

ii.im  iI  mi  si  nu  71,  |i.  :ou. 


Figure  15.  Drop  Algorithm,  Initialization 

Iteration  1.  We  now  close  each  concentrator  one  at  a  time,  transferring 
the  terminals  attached  to  these  concentrators  to  other  locations,  and  examine 
the  impact  on  the  cost. 

Starting  with  S^,  we  discover  that  transferring  T ^  to  either  S  or  S„ 
results  in  a  cost  increase  of  1.  The  same  remark  applies  to  T„.  By  trans¬ 
ferring  to  either  S0  or  S^  we  obtain,  once  again,  a  cost  increase  of  1. 

Since  f,  =  2,  we  obtain  a  net  increase  in  cost  of  1  (5  -  fo).  Hence  we  do  not 
close  S j . 

Next  we  close  S0,  transfer  to  S  ,  and  produce  a  net  improve¬ 
ment  of  2. 

Closing  S,?  results  in  an  (unacceptable)  increase  in  cost  of  1. 

Hence,  we  choose  to  close  S,, ,  and  transfer  T  to  S  ,  with  a  resultant 
cost  of  Z  !).  The  resultant  configuration  is  the  same  as  that  produced  by 
the  add  algorithm's  final  pass  (figure  15). 
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lie  nil  ii'n 


I  siii);  (In'  oonfigurat ion  olitainod  from  tho  sooond 


ttoration,  wo  now  dotormino  if  rlosiii);  otthor  S,(  or  results  in  a  oost 
improvomont.  No  improvomont  turns  out  to  tv  possilvlo;  thoroforo  wo  ond 
with  (ho  sumo  notwork  oonfigurat ion  produood  In  tho  add  algorithm. 


I’onnoottng  l'orminals  to  Iho  I’onoonlrator 

In  dosoribing  llioso  algorithms,  wo  onoo  again  follow  tho  oxposttion 
givon  hy  Sohwart,'  |St'IIW77|  . 


I'hroi'  algorithms  aro  list'd  lor  tho  "lino-la\ out"  prohlom. 

1.  I'ho  I'sau  Williams  Algorithm  forms  tho  ooro  of  tho  appmaoh 
most  oommonh  in  uso  todav  It  is  tho  algorithm  omployod  Iw 
tho  1HM  notwork  dosign  program,  i'NIH’  vi'ommuniontions 
Notwork  Ht'sign  Program). 

kruskal's  Vigor  it  Inn  is  hasoti  upon  tho  minimum  spanning  troo 
algorithm,  whioh  produoos  tin'  ohoapost  notwork  dosign. 

H  I'ho  I  ni  t  i«'d  Hosi^ii  Algorithm  omUulios  tho  I'sau  Williams 

algorithm  as  ti  spooial  oaso,  ns  woll  as  othov  dosign  algorithms 
that  aro  ourrontlv  implomonlod.  Phis  algorithm  gonoratos 
lH'ttor  solutions  than  tho  I'sau  Williams  algorithm  as  a  rosult 
of  tho  taut  that  it  inoludos  olhor  algorithms  as  spooial  oasos 
of  it.  Honor  a  numbor  of  solutions  oan  ho  simultanoously 
gonoratod.  and  a  si  lootion  of  tho  Ih'sI  ono  oan  1h'  mado. 

Kiguro  It),  from  Si'll\V77,  illustratos  tlio  algorithm’s  o|vrnti«m. 

I'ho  tour  torminals  illustraloil  in  figuro  Id  aro  to  In'  oonnootod  to  tho 
indioati  d  oomputor  at  nodo  I 


II 


CENIRAl  FACILITY 


II.IM  .I  I>I1  M  ll\P7,  |\  201. 


Figure  in.  Example  for  Line-Layout  Problem 
1  lie  traffie  generated  at  eaeli  link  is  given  bv: 

a2  ~  2’  au  a'a4=2*a5^ 

1  he  oost  of  establishing  a  link  lietween  any  pair  of  nodes  is  given  by 
the  following  oost  matrix 

NODE 


NODE  :i 
4 


We  denote  the  cost  of  connecting  the  two  links  i  and  j  by  c  ,  and 
allow  the  maximum  flow  on  any  link  to  be  5  units.  This  maximum  flow  could 
correspond  to  the  actual  capacity  of  the  lines  themselves,  or  to  a  maximum 
allowed  flow  on  the  lines  that  is  determined  from  a  maximum  time-delay 
requirement. 


Fsau-U'illiams  Algorithm  [  FSAt'tWi  ] 

This  algorithm  operates  by  searching  out  the  nodes  most  costly  to 
connect  to  the  computer  and  replacing  each  one  by  a  connection  to  a  neighbor- 
ing  node  (which  is  then  connected  to  the  computer).  Thus, 


might  bo  replaced  by 


Step  0 


Initialize  by  calculating  all  trade-off  parameters,  t 


if 


where 


t.. 

i.i 


c, , 

i.l 


-  c 


il 


for  all  i,  j.  Thus  t,.  measures  the  difference  in  cost  between 

ij 

connecting  node  i  to  the  computer  via  node  j,  and  connecting 

node  i  directly  to  the  computer.  From  our  example, 

t„  j  -  c„  (  -  c.,  j  =  1 .  Clearly,  we  are  in  search  of  negative  t . . ' s . 

Step  1  Select  the  minimum  t..,  and  consider  connecting  i  to  i.  In 

our  example,  the  expression  t  =  -5  is  the  minimum. 

f>3 

Step  2  Check  to  see  that  the  flow  constraints  are  satisfied,  (is  the 
line  overloaded?) 


IF  go  to  step 

Fl.SF  set  t  or ,  and  go  to  step  1. 
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4  *IMIM  ■ 


For  our  example,  a.  I  if  it  is  connected,  which  is  clearly 
less  than  the  maximum  of  5.  Hence,  we  continue  to  step  3. 

Step  3  I. ink  i-j  is  added  to  our  network.  Reevaluate  the  constraints 
and  update  the  trade-off  functions.  Go  to  step  l. 

In  our  example,  flow  a  Ix'comcs  a.  a.  *  a  J  since 

•'  3  ;>  3  ’ 

node  •>  is  connected  to  node  3.  Functions  t  and  t  are 
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deleted  from  our  list. 

It  would  Ih’  worthwhile  for  the  reader  to  continue  with  our  example. 
The  final  result  is  depicted  in  figure  17. 


THE  Nl  IMBE  US  ON  THE  t  INKS 
.j  INDICATE  THE  OHDER  IN  WHICH 
THE  CONNECT  IONS  WE  HE  M  ADI 


I  -  |  M.isi-.l  on  Si  n.  |>.  i  s: 

Figure  17.  Fsau-Williams  Algorithm,  Final  Result 
Kruskal's  Algorithm 

As  was  mentioned  earlier,  one  of  the  most  conceptually  appealing 
algorithms  involved  in  the  design  of  centralized  networks  is  the  minimum 
spanning  tree  algorithm.  Kruskal's  algorithm,  which  is  based  on  the  m.  s.  t. 
algorithm,  is  included  here  for  the  sake  of  illustration,  despite  the  fact  that 
it  is  not  used  in  any  of  the  commercially  available  design  packages. 


A  ■ 


Tito  statement  of  this  algorithm  is  quite  simple.  Carry  out  the 
following  step  until  no  longer  possible :  Among  t ho  links  not  yet  selected, 
choose  the  least-cost  link,  cheeking  to  make  sure  that  it  does  not  form 
a  circuit  with  the  links  already  selected,  and  that  flow  constraints  are  not 
violated.  II  one  does  not  check  for  the  flow  constraints,  one  produces  the 
minimum  spanning  tree. 

The  m.  s.  t.  algorithm  provides  a  lower  bound  on  the  cost  of  the  net¬ 
work.  All  of  tlu’  algorithms  discussed  in  this  section  will  reduce  to  this 
design  if  all  constraints  are  removed. 

Returning  to  our  example,  we  note  that  the  minimum  entry  in  the  cost 
matrix  is  3.  Links  '2-1,  3-1,  and  1-3  all  have  this  same  value.  Hence  we 
choose  randomly  among  these  links,  picking  link  2-1.  The  flow  constraint 
is  satisfied  and  the  connection  is  made.  Repeating  this  procedure  for  links 
3-1  and  1-3,  we  find  that  these  connections  can  also  be  made.  However,  no 
further  connections  may  be  made  through  node  3,  as  the  flow  in  link  3-1  has 
now  reached  its  maximum  value  of  a. 

Continuing  in  this  fashion,  we  arrive  at  the  network  shown  in  figure  IS. 


THE  NUMBERS  IN  PARENTHESES 
INDICATE  THE  ORDER  IN  WHICH 
THE  CONNECTIONS  ARE  MADE 

II:, VO. I  Oil  S,  HW  7  7,  p.  I  8.V 


Figure  IS.  Kruskal  Algorithm,  Final  Result 


I  nificd  Heuristic  Algorithm  (due  to  Kershenbaum  and  Chou 


Kershenbaum  and  Chou  show  that  a  large  number,  if  not  all,  of  the 
algorithms  used  for  the  line-layout  problem  may  l>c  unified  under  their 
banner  [KKKS7  1]  .  As  mentioned  before,  this  produces  a  significant  advan¬ 
tage  for  their  algorithm,  as  no  single  line-layout  algorithm  is  really  perfect 
for  all  possible  cases.  Hence  the  ability  to  generate  parallel  designs  is  an 
important  one. 

The  algorithm  assigns  a  weight,  w  ,  to  each  terminal,  and  then  com¬ 
putes  a  trade-off  function  t  as  follows: 

tj 

t . .  =  c , ,  -  w . 
i.l  l.l  i 

where  c  is,  as  before,  the  cost  of  connecting  terminal  i  to  concentrator  i. 
By  defining  w  differently,  one  obtains  each  of  the  algorithms  already  dis¬ 
cussed  as  well  as  a  host  of  others.  The  reader  is  referred  to  KERS74  for 
a  more  complete  discussion  of  this  property.  The  weights  appropriate  to 
each  of  the  algorithms  discussed  in  our  paper  are  listed  below  (from  KERS74, 
p.  17(16): 

I'pdatc  When  (i  -  j) 


Algorithm  Initialization  Is  Brought  In 

Esau-Williams  w  =  c, . ,  i  =  1 ,  .  . .  ,  N  w,-*-w 

til  i  j 

Kruskal  w.  =  0,  i  -  1,  .  .  .  ,  N  None 


The  algorithm  proper  is  as  follows: 

Step  0  Initialize  the  w.  for  i  =  1,  .  . .  ,  N. 

Initialize  the  constraints. 

Set  t,.-*-e.,  -  w,  for  all  i,  i  when  no  constraints  are  violated 
i.l  ij  i 

by  making  this  connection. 


Step  1  Compute  t  .  =  min  t  for  all  i,  i,  i  4  i. 
t ,  .1  i.) 

IF  t.  .  ■■■  op  terminate  the  algorithm. 
i,.l 

ELSE  go  to  step  2. 


If  a  =  0,  one  obtains  the  Kruskal  algorithm,  while  if  a  =  b  =  1,  the  Esau- 
Williams  algorithm  is  obtained.  Differing  values  of  a  and  b  produce  a  family 
of  solutions.  It  is  useful  to  note  that  the  expression  in  parentheses  is  essen¬ 
tially  a  straight  line  joining  c.^  and  c  . 

The  interested  reader  should  consult  SC1IW77,  chapters  9  and  10,  for 
a  more  complete  coverage  of  design  algorithms  for  centralized  networks,  as 
well  as  MART72,  chapters  40  through  13,  for  the  "IBM  approach." 


DESIGN  OF  DISTRIBUTED  NETWORKS 

As  indicated  earlier,  the  principal  design  difficulty  arising  in  distrib¬ 
uted  networks  is  the  interplay  of  routing  considerations  with  the  remaining 
problems  (capacity  allocation,  etc. )  In  light  of  this  difficulty,  we  present 
a  brief  discussion  of  routing  in  a  distributed  network,  and  follow  this  with 
a  discussion  of  a  global  design  algorithm,  known  as  the  "cut-saturation" 
algorithm. 
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Routing  in  a  Distributed  Network 


It  is  important  to  lie  aware  of  the  faet  that  we  are  eoncerned  with 
specifying  a  fixed  routing  policy  ,  that  is,  one  that  does  not  vary  with  time 
according  to  changing  traffic  conditions.  The  objective  is  to  provide  a  mini¬ 
mum  average  time-delay  design,  assuming  stationary  traffic  conditions. 

This  approach  is  clearly  consistent  with  our  situation,  as  we  are 
concerned  with  the  design  of  the  network;  and,  because  of  the  interplay  of 
routing  with  other  design  considerations,  wo  must  have  some  idea  as  to  the 
appropriate  distribution  of  flows  in  the  network.  For  informative  discussions 
on  routing,  the  reader  may  wish  to  consult  SCHW77,  chapter  it  or  K\.F.l7t'>, 
chapter  5. 

We  wish  to  present  an  algorithm  for  routing  which  was  employed  in  the 
design  of  the  AllPANF.T ,  the  Flow  Deviation  algorithm.  First,  however,  we 
need  an  expression  for  the  average  time-delay  encountered  by  a  packet  in 
a  distributed  network.  Such  an  expression  has  been  developed  by  Kleinrock 
[KI.F.17t>j  .  In  developing  the  time-delay  formula,  he  assumes  Poisson 
arrivals  and  exponential  message  lengths. 

In  assuming  exponential  message  lengths,  Kleinrock  invokes  the 
so-called  independence  assumpt ion  —  essentially  ignoring  the  interdepen¬ 
dence  between  arrival  rates  and  service  times  at  a  given  node  in  the  network. 
This  assumption  is  tenable  due  to  the  external  arrival  of  messages  at  a  given 
node,  and  has  I  icon  verified  experimentally.  See  Kl.F.UU  and  Kl.FI7(>. 

Assuming  NA  channels  (ares)  and  N  nodes  in  our  network,  Kleinroek's 
formula  is 


4S 


i  flow  (bits  'sec)  in  the  i—  channel 


■)  =  total  traffic  in  network 


—  =  average  length  of  a  data  packet 


—  -  average  length  of  all  packets  (including  acknowledgments) 

t* 

C.  =  capacity  (bits /sec)  of  the  i— channel 
P.  =  propagation  delay  on  i—  channel 


K  =  nodal  processing  time 


\ 


fi'  C. 

The  first  expression  in  the  brackets  — — - —  is  an  expression  for  the 

(i'C.  -  X. 

th  1  1  1 

waiting  time  at  the  i — channel,  while - —  is  an  expression  for  the  service 

(i.e.  transmission  time)  on  this  channel.  One  also  includes  P.,  the  propa¬ 
gation  time  to  transmit  one  bit  down  the  length  of  the  channel.  This  result, 
based  on  Jackson's  theorem  [  JACK57  and  JACK 63  1  ,  is  essentially  a  sum¬ 
mation  of  delay  times  due  to  a  series  of  M/M/1  queues. 


We  now  are  in  a  position  to  present  a  summary  version  of  the  Flow 
Deviation  algorithm,  which  was  employed  in  the  design  of  the  ARPANET. 

Step  1  Suppose  NA  =  number  of  arcs,  and  f”  =  (f  ,  f  )  the  vector 

1  NA 

representing  the  set  of  link  flows  at  the  n—  iteration  of  the  algorithm. 
Compute : 

1.  =  3T/3f.,  i  =  1 . NA  . 


Step  2  Find  the  shortest- route  flow  vector  V,  utilizing  the  above  1  as 

i 

the  link  "lengths.  " 


Step  3  l  et 


,n  l 


( 1  -  A  If 


o  \ 


i 


and  employ  a  search  method  te.g.  Filionaeoi)  ti>  locate  a  \ 
which  minimizes  1  \  more  detailed  discussion  of  this 

algorithm  ma\  U>  found  in  Kl.FITti,  chapter 

Another  algorithm  for  routing,  the  Extremal  Flow s  algorithm  due  to 
Cantor  and  Gerla  (  FAN  FT  I  ]  ,  is  presented  in  section  3  of  this  paper  in  the 
discussion  id  Network  Analysis  Corporation's  model  for  distributed 
network  design. 

chit -Saturation  Algorithm 

We  present  in  this  section  an  outline  of  the  "cut-saturation  algorithm," 
which  is  one  of  three  "known"  design  techniques  employed  in  distributed- 
network  design,  t  he  algorithm  was  developed  b\  Network  Analysis  Corpo¬ 
ration,  and  will  be  discussed  in  somewhat  greater  detail  later  in  this  paper. 
The  other  two  algorithms  are  the  Hranch-Xehiuigo  algorithm,  and  the  Concave 
Branch  Elimination  Algorithm,  kleinroek  and  Gerla  [GERL77]  attempt 
a  comparison  of  tin  performance  of  these  algorithms,  and  reach  some  inter¬ 
esting  conclusions.  One  of  the  most  notable  is  the  underlining  of  t he  diffi¬ 
culty  in  even  making  these  comparisons,  l  he  authors  point  out  the  need  for 
work  in  this  area. 

The  cut -saturation  (C-sl  algorithm  aims  at  producing  the  least-cost 
distributed  network  for  a  given  throughput,  subject  to  time  delay  and  reli¬ 
ability  constraints.  1’he  algorithm  as  presented  below  assumes  that  link 
capacities  are  given,  and  that  they  are  all  equal.  However,  it  is  claimed 
that  a  straightforward  extension  of  the  algorithm  that  includes  the 


multiple-capacity  case  is  easily  implemented.  For  a  fuller  discussion  of 
the  algorithm,  see  GERL74. 

The  algorithm  consists  of  the  following  five  steps  (for  each  iteration): 
Step  1  Routing 

Starting  with  a  given  network  design,  link  flows  are  found  that 
minimize  the  overall  time  delay. 

Step  2  Saturated  Cut-Set  Determination* 

After  determining  the  flows  in  the  network,  the  links  are  ordered 
according  to  their  use.  The  minimal  set  that  will  disconnect  the 
network  —  the  saturated  cut-set  —  is  then  removed.  An  example 
of  a  saturated  cut-set  is  shown  in  figure  19. 

Step  3  Add-Only  Step 

The  purpose  of  this  step  is  to  add  the  least-cost  nodes  to  the 
network  —  which  will  divert  traffic  from  the  saturated  cut-set. 
Since  one  would  like  to  divert  traffic  as  far  away  from  the 
saturated  cut-set  as  possible,  and  since  the  cost  of  communication 
links  increases  with  distance,  one  must  effect  a  compromise  of 
sorts.  Such  a  compromise  is  achieved  by  using  a  "distance  2" 
criteria  —  nodes  that  are  a  minimum  of  two  links  removed  from 
the  cut-set  are  chosen  as  potential  candidates  for  removal.  This 
is  illustrated  in  figure  19. 


Gerla,  et  al.  [GERL74]  claim  that  this  cut-set  approximates  the  famous 
minimum  cut-set  of  the  Ford- Fulkerson  theorem. 


Saturated 
cut  set 


•  *  "Distance  2"  nodes  for  add  only  step 

I  rum  N  \i  '7J,  p.  4-7,  Reprinted  by  permission. 

Figure  19.  Saturated  Cut  in  the  ARPANET 
Step  4  Delete-Only  Step 

The  purpose  of  this  step  is  to  remove  links  that  have  the  least 
"marginal  utility"  from  the  network,  one  per  iteration. 
Mathematically,  the  links  are  chosen  according  to  the  following 
criteria. 

c.  -  f. 

Maximize  E  =  D  •  — - — 

i  i  c. 

l 

where 

D.  is  the  cost  of  the  link 
c.  is  the  capacity  of  the  link 
f.  is  the  flow  in  the  link 
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Stop  5 


I  Vrturbat ion  Stop 


This  stop  reduces  llu*  not  work  oost  once  tho  desired  throughput 
range  has  boon  achieved.  t'sing  arid-only  and  rioloto-onlv 
operations,  tho  notwork  links  aro  "reorganized , "  while 
maintaining  tho  throughput  within  i  f>  poroont  of  tho  desirod  goal. 

Comparisons  aro  also  made  to  previously  ohtainod  solutions  in 
order  to  determine  if  a  hot  tor  throughput  at  a  lowor  oost  has 
already  boon  ohtainod.  figure  20  contains  a  flow-chart  of  this 
slop,  t  he  ipiostion  "Is  tho  notwork  dominated""  in  tho  flow¬ 
chart  refers  to  those  comparisons. 

Stop  0  Chain-Collapsing 

Certain  chains  (in  "series")  a  re  replaced  by  a  single  link  to 
improve  the  algorithm's  efficiency. 

Kloinroek  and  Corla  [  l  77  |  note  that  tho  out-saturation  method  is 
an  extension  ol  the  branch  Xehango  method  —  which  essentially  starts  with 
a  given  topology,  and  obtains  local  minima  by  means  of  local  transformations. 
Those  transformations,  called  branch  Xehangos,  eliminate  existing  links 
and  replace  them  with  new  ones.  The  algorithm  is  described  (;•  l-'HANTOb. 

The  cut-saturation  algorithm  represents  an  advance  over  the  branch  Xehango 
method  in  that  it  employs  information  from  tho  problem  domain  while  molli¬ 
fying  tho  notwork,  whereas  the  branch  Xehango  method  does  not. 

The  concave  branch  elimination  method  is  particularly  useful  whenever 
the  discrete  costs  can  lx'  reasonably  approximated  by  concave  costs.  The 
method  relies  on  a  How  deviation  algorithm,  making  use  of  concave  costs, 
and  a  capacity  assignment  algorithm.  1'he  flow  deviation  algorithm  tends  to 
eliminate  uneconomical  links  from  the  topology.  Kloinroek  and  florin  point 


* 


I 


H<\Uf  in  V.VtOUS  lUMmcK’i  v 
topoU>g\  numht'i  of  itrMtiOns  tlt’Mi  nt 
1  ,,  %  throughput  hounds  H  M 

I 


Mout'uq  .ilgonthiY) 
dtMt'i  n > i mi's  throughput  H 

I 

I  i  i  <  1 1 


*»»  N  \*  |v  }  |  v  K.  I'lininl  h  |'(’ii«t«ion 


I'inuro  .’0.  Mow  rhiU'l  >'l  Saturntoil  rut  Sot  Aljforltluu 


i»\it  ih.it  tin'  choice  ol  llu*  partteular  algorithm  depends  upon  the  eost  nnd 
eapae it \  structure,  ns  well  as  upon  tlu'  degree  of  precision  required.  1'hov 
also  prosont  a  comparison  of  thoso  methods  for  a  particular  design  problem, 
and  ivaoh  tlio  oonolusion  mentioned  U'fore  —  that  it  is  a  difficult  task  to  ovon 
attempt  a  reasonable  comparison  among  the  existing  design  algorithms  for 
dist ributed  networks.  They  suggest  that  research  in  tin'  analysis  of  algorithm 
complcxitv  should  lv  pursued  to  these  ends. 

this  is  proha  hi  >  a  fitting  note  upon  which  to  end  our  discussion  of 
design  algorithms  pind  comparisons  lietween  them),  atnl  launch  into  a  dis¬ 
cussion  ot  commcrcialh  available  design  packages.* 


In  the  event  that  the  reader  is  not  sufficiently  intimidated  by  Kleinrock  and 
(lerla's  entotner.  we  remind  him  of  some  of  the  unapproached  problems 
discussed  earlier  in  (tils  section. 


six'  tion  :i 

COMP AH1SON  op  MOUKI  S 

A  number  of  organizations  were  contacted  in  I  ho  course  of  this 
survey  concerning  llu'  programs  (hoy  have  written  for  designing  and 
analyzing  computer  networks  The  organizations  worn  identified  as  a 
result  o|  a  literature  search  in  which  throe  sources  proved  to  ho  tho  most 
t i'ii i t tii  1  I'hese  sources  wore 

•  \  memorandum  written  by  Mr.  Hruce  I'cldmever  of  M1THK. 

Ih'l  M  ).!  l!>.  which  identified  industrial  organizations  that 
have  produced  soilwaro  packages  intended  to  aid  in  tho 
design  of  communications  processors  (front  ends,  con¬ 
centrators,  etc .  h 

•  "  I  bird  Mutual  Survex  ol  Performance  belated  Software 
Packages,"  I  HP  Portormanee  lioviow,  IVcottthor  P.'Tf). 

•  Articles  in  tho  technical  literature  describing  algorithms 
employed  in  the  various  facets  of  computer  network  design. 

The  organizations  were  contacted  hy  telephone  or  letter.  In  all 
cast's  literature  was  requested  concerning  their  software,  with  a  particular 
emphasis  on  technical  papers.  I  he  respondents  in  general  were 
cooperative,  although  one  companx  preferred  to  discuss  its  philosophy 
instead  of  its  product. 

I'wo  meetings  were  held  with  companies  in  the  course  of  this  project 
One  meeting  was  with  Scientific  l  ime  Sharing  (Sl'St  at  MITKK's  Bedford 
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offices.  STS  demonstrated  their  product,  ODIN,  and  discussed  its  capabili¬ 
ties.  Another  meeting  was  held  at  the  Randolph,  Mass.  ,  offices  of 
St  ientitic  Applications  Inc.  (SA1),  at  which  the  company's  philosophy  was 
discussed. 

A  list  of  t he  organizations  discussed  in  this  report  appears  in 
table  1.  As  is  to  bo  expected,  not  all  organizations  contacted  in  the 
course  of  this  project  appear  in  the  table  -  for  a  variety  of  reasons. 

A  prime  example  is  HIM,  which  possesses  a  tool  for  network  design, 
but  is  interested  in  using  it  for  the  design  of  networks  built  by  IBM, 
and  not  by  the  competition. 

I  he  focus  of  the  report  is  on  a  comparison  of  the  algorithms 
employed  in  the  various  tacets  ol  network  design.  No  hands-on  evaluation 
ol  the  various  design  tools  has  boon  attempted  —  only  an  analysis  of  the 
algorithms  employed.  In  keeping  with  this  stated  approach,  the 
descriptions  of  the  packages  arc  organized  along  the  following  lines: 

•  General  —  1’he  types  of  networks  designed  (centralized  or 
distributed)  are  mentioned.  Specifics  of  implementation 
(computers,  languages)  arc  also  described. 

•  Functional  Organization  of  the  Programs  —  The  functions 
performed  by  each  of  the  program  modules,  as  well  as 
their  interrelationships,  are  discussed. 

•  Algorithmic  Structure  —  We  often  make  use  of  the  discussion 
in  section  2,  and  simply  refer  to  the  appropriate 
description. 

•  Summary  —  Brief  concluding  remarks  that  attempt  to  por¬ 
tray  the  software  constitute  the  focus  of  this  section. 


Table  1 

List  of  Organizations  and  their  Programs 


Organization  Program  Name 

1.  Kranzley  &  Co.  ,  Telecommunications  Division  PLANET 

1010  South  Kings  Highway 

Cherry  Hill,  New  Jersey  08034 

2.  Scientific  Time  Sharing  ODIN 

7316  Wisconsin  Avenue 

Bethesda,  Maryland  20014 

3.  Systems  Architects,  Inc.  SADC,  SADM, 

Thomas  Patton  Drive  SAND,  TALK 

Randolph.  Massachusetts  02368 

4.  Professor  G.  M.  Schneider  VANS 

Department  of  Computer  Science 

University  of  Minnesota 

114  Lund  Hall 

207  Church  Street,  S.  E. 

Minneapolis.  Minnesota  55455 


5.  The  DMVV  Group.  Inc.  NDMS 

2975  Hickory  Lane  AND  MS 

Ann  Arbor,  Michigan  48104 

6.  Network  Analysis  Corporation  GRINDER 

130  Steamboat  Road  MIND 

Great  Neck,  New  York  11024 
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One  exception  has  been  made  to  this  format:  that  is  the  VANS  model 
created  at  the  University  of  Minnesota.  This  exception  was  made  as  a 
result  of  the  special  nature  of  the  propram. 

On  the  whole,  two  distinct  approaches  appeared  in  these  design  tools. 
One  approach  was  modeled  after  IBM's  Communications  Network  Design 
Program  (UNDP).  In  spite  of  the  secrecy  that  shrouds  the  official  docu¬ 
mentation  of  this  program,  James  Martin  has  written  extensively  about 
the  algorithms  employed  therein  in  his  well-known  book,  Systems  Analysis 
lPJ*_ P&tft  Transmission  [MART72],  It  became  quite  clear  in  the  course 
of  this  project  that  all  of  the  organizations  contacted,  with  two  exceptions, 
follow  the  IBM  lead  in  structuring  their  design  tools.  The  two  exceptions 
were  the  Network  Analysis  Corporation  (NAC)  models,  and  the  VANS 
model.  It  should  be  noted  that  CNDP  is  oriented  towards  the  design  of 
centralized  networks,  and  accordingly,  the  only  company  that  designs 
truly  distributed  networks  is  NAC. 

Having  concluded  our  preamble  with  these  words,  we  now  commence 
our  discussion  of  the  individual  models. 


KRANZ LEY  &  COMPANY  —  PLANET 
General 

1  LANE  r  (Programs  lor  Line  Analysis  and  Network  Engineering 
I  asks)  is  essentially  a  collection  of  programs,  in  the  CNDP  mold,  which 
address  different  parts  of  the  network  design  issue.  These  packages,  to 
1)0  discussed  later,  may  tx>  put  together  in  a  variety  of  ways,  depending 
on  the  user's  specific  application^ ). 

The  package,  as  a  whole,  is  oriented  towards  circuit-switched  and 
message-switched  networks.  For  networks  of  the  latter  category,  it  aids 
m  the  design  of  centralized  networks.  It  is  written  in  FORTRAN  and  is 
available  in  local  batch  as  well  as  time-sharing  environments.  Support,  in 
the  torm  ot  post-installation"  consulting,  is  included  as  part  of  contractual 
arrangements. 

Functional  Orga niza t ion 

The  basic  modules  which  comprise  the  PLANET  system  are  described 

below. 

MODEL  is  a  discrete  event-stepped  simulation  of  a  communications 
line.  This  program  produces  performance  statistics  (response  time, 
throughput)  for  different  line  configurations  and  transaction  nrofiles  (mes¬ 
sage  lengths,  arrival  rates,  etc.  ). 

NETS YN  provides  a  least-cost  configuration  of  a  (private)  network, 
such  that  this  configuration  will  meet  the  response-time  and  throughput 
characteristics  required  by  the  user.  Hence,  for  inputs  of  terminal.  CPU, 
and  modem/concentrator  locations,  NETSYN  produces  a  low-cost  multi¬ 
point  line  layout  that  will  meet  the  user's  response  time  and  throughput 
requirements. 
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IjOCON  is  a  program  that  positions  multiplexors  and  concentrators 
in  the  network. 

WATSYN  (WATS  synthesizer)  is  a  set  of  programs  for  use  in  the 
selection  of  the  appropriate  WATS  service.  It  develops  the  sets  of  bands 
to  be  equipped  with  lines  as  well  as  the  full/measured  split  for  each  band  so 
equipped  for  inputs  on  load  distribution,  performance  criteria  (e.  g.  , 
desired  grade  of  seiwice)  and  costs. 

TOM. PRO  is  a  series  of  programs,  designed  to  gather  statistics 
about  long  distance  calls,  that  can  be  used  for  cost  allocation  purposes  as 
well  as  design  purposes.  1'he  break-up  of  calls  per  WATS  band  can  be 
used  as  input  to  WATSYN. 

SYSMOPEI.  is  used  to  model  computer  system  performance.  For  a 
given  configuration  it  develops  response  time  and  throughput  characteristics, 
and  also  pinpoints  bottlenecks  in  the  system. 

The  first  three  modules  can  be  arranged  as  shown  in  figure  21. 
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Figure  21.  Modules  in  PLANET 


The  last  modulo  discussed.  SYSMODEL,  can  bo  used  in  conjunction 
with  MODEL  to  produce  a  more  accurate  traffic  table,  as  it  will  include 
the  computer  system's  performance  in  creating  the  traffic  table. 

Algo rithmie  Structu re 

MODEL  is,  as  mentioned  before,  a  discrete  event-stepped  simulation 
of  a  communications  line. 

The  inputs  to  the  simulation  include  the  following: 

•  Terminal  and  control  unit  configurations 

•  Transaction  profiles  for  each  application 
Different  types  of  inquiries  and  responses  (formatted, 
unformatted,  etc.  ) 

Message  lengths 

Mean  arrival  rates  per  transaction 

•  Equipment  characteristics 

•  Response- time  objectives 
Output  includes 

•  Utilization  of  communications  lines 

•  Queue  lengths 

•  Response  times 

•  Percentage  of  messages  delayed  at  keyboards 

MODEL  is  therefore  utilized  to  create  a  traffic  table  of  maximum 
line  loadings,  to  be  employed  as  input  to  LOCON. 


LOCON  employs  an  add  algorithm,  as  discussed  in  the  preceding 
section,  to  position  the  concentrators. 

NETSYN  essentially  employs  Kranzley  &  Co.  's  version  of  the 
Esau- Williams  algorithm  for  least-cost  line-layout  as  also  discussed  in 
the  preceding  section. 

Summary 

As  indicated  earlier,  PLANET  is  essentially  written  along  the  lines 
of  IBM's  CNDP.  Its  major  drawbacks  are  a  lack  of  design  tools  for 
distributed  networks  and  a  lack  of  reliability  models  (in  the  military  sense). 
PLANET  appears  (on  paper)  to  be  a  solid,  well-maintained  product,  useful 
in  the  design  of  centralized  networks. 
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SCIENTIFIC  TIME  SHARINC..  INC.  -  ODIN 


Of  no  ml 

STS  markets  a  network  tie  si  pi  model  called  ODIN.  (The  company 
seems  to  have  an  interest  in  mythology.  )  The  model  is  interactive, 
is  written  in  API.,  and  is  supported  by  a  local  (Boston!  area  office.  It 
should  be  noted  that  SI'S  specializes  in  time-sharing  services  directed 
towards  API.. 

I  he  model  is  oriented  towards  the  design  of  a  centralized  circuit- 
switched  network,  and  can  be  utilized  in  the  design  of  message- switched 
networks.  Essentially.  ODIN  consists  of  a  series  of  modules,  which  can 
be  used  in  a  variety  ot  ways  for  the  design  of  computer  networks.  The 
company  provides  a  well-written  user's  guide  to  the  system,  as  well  as 
support  services  as  part  of  the  contractual  arrangements. 

Functional  i Organization 

Figure  22.  which  appears  in  the  user's  guide  to  ODIN  [FLLITS], 
presents  the  modules  and  their  interrelationships. 

The  ODIN  module  serves  as  a  system  executive:  it  initializes  para¬ 
meters  and  determines  the  options  to  lie  utilized  in  each  run.  It  accepts 
input  in  the  form  of  performance  goals  (response  time,  throughput),  data 
requirements  (tariff  environments,  message  lengths,  traffic  loadings, 
etc.  !  and  facilities  (terminals,  types  of  lines,  modems,  etc.  1  and  creates 
a  data  set  with  them.  It  also  constructs  a  distance  matrix  of  the  cities  in 
the  network. 
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Figure  22 


Flow  Chart  for  ODIN  Network  Design  Program 


I'he  QTAC  (Queueing  Theory)  module  uses  standard  queueing 
equations  to  determine  a  line-loading  table  (maximum  permitted  trans¬ 
mission  rates  terminal).  QTAC  can  lx>  bypassed  if  the  user  so  wishes 
(in  the  event  he  has  his  own  line-loading  table). 

NEXUS  determines  locations  for  concentrators,  given  a  list  of 
possible  locations. 

ADONIS  creates  a  least-cost  network  design,  employing  the  loading 
table  created  by  either  QTAC  or  the  user. 

ARIES  provides  a  line-by-line  "expense  report"  of  the  network. 
Algori  t  hmie  St  rue  t  ure 

We  omit  a  discussion  of  the  ODIN  module,  as  it  is  a  preprocessing 
module,  and  of  the  ARIES  module  which  functions  as  a  post-processing 
module. 

NEXUS  locates  concentrators,  utilizing  a  candidate  list  supplied  by 
the  user.  It  uses  an  add  algorithm,  as  discussed  in  section  2. 

QTAC  is  a  queueing  module  that  is  used  to  obtain  line- loadings. 

It  employs  the  Khintchint  -  Polloczek  equations  to  determine  the  response 
time  and  throughputs  obtainable  under  differing  line  disciplines. 

The  Khintchine- Polloczek  equations  provide  an  expression  for  the 
mean  number  of  items  awaiting  service.  E(w). 


K(w) 


where 

w  number  of  items  awaiting  service. 

a  facility  utilization.  In  a  steady-state  model  p  F.(n)  Fqt  ) 
where  K(n)  is  the  mean  arrival  rate  and  F(t  ,)  is  the  mean 
service  rate. 

Q'l'AC  uses  this  equation  in  building  models  of  line  disciplines, 
and  then  solves  it  to  build  a  line-loading  table.  'lie  line  disciplines  are 
essentially  combinations  of  the  following  six  parameters 

Fine  t Operation:  Full-duplex  or  llalf-duplex 

Host  Control  Released  or  Held 

Fine  Control:  Polled  or  Contention 

Figure  2d  is  a  diagram  illustrating  the  six  models  included  in  QTAC. 

I  bis  figure  is  taken  directly  from  the  ODIN  user's  manual. 

Two  polling  algorithms  may  be  employed: 

1.  FOX  (Full-duplex)  Algorithm  —  Here  outbound  messages 
from  the  host  may  be  transmitted  only  while  an  inbound 
message  is  being  transmitted  or  after  all  of  the  terminals 
have  been  polled  once.  Polls  have  priority  over  messages 
on  the  outbound  line. 
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Figure  23.  Line  Disciplines  Accommodated  bv  ODIN' 


2.  HPX  (half-duplex)  Algorithm  —  Outbound  messages  have 
a  higher  priority  than  inbound  messages.  They  also 
have  a  priority  over  polls. 

l'igure  24  (from  the  user’s  manual)  summarizes  the  response-time 
modules. 

ADONIS  is  STC's  implementation  of  the  Esau- Williams  algorithm 
for  configuring  least-cost  centralized  networks.  This  algorithm  was 
discussed  in  section  2. 

Summary 

STS' 8  model  is  written  along  the  lines  of  CNDP  (again).  Its  novel 
features  are  that  it  is  written  in  APE.  and  that  it  employs  queueing 
models  rather  than  simulation  models  for  developing  line-loading  tables. 
On  the  whole,  ODIN  appears  to  be  a  solid,  well-supported  tool  for  the 
design  of  centralized  networks. 

Its  major  drawbacks  appear  to  be 

1.  back  of  provisions  for  the  design  of  distributed  networks. 

2.  Use  of  queueing  equations  that  can  become  inaccurate 
when  evaluating  polling  (versus  contention)  models 
for  line  organization. 


:t.  back  of  reliability  models. 


Figure  24.  Response  Time  for  Different  Line  Disciplines 


SYSTEMS  ARCHITECTS,  INC.  -  SAPC,  S ADM ,  SAND,  TALK,  SAPA 
General 

SAI  markets  a  series  of  models  that  may  be  used  for  planning  and 
reviewing  a  data  communications  network.  The  major  design  programs 
are  interactive  in  nature,  while  other  supporting  programs  are  accessed 
in  batch  mode. 

The  programs  are  written  in  EORTRAN,  and  are  available  through  a 
time-sharing  service,  or  may  be  installed  at  the  user  sites. 

Unfortunately.  SAI  declined  to  give  any  information  as  to  the  nature 
of  the  algorithms  employed  in  their  packages.  A  substantive  discussion 
of  SAI's  products  is  impossible  under  the  circumstances,  and  we  can 
only  discuss  them  from  a  functional  point  of  view,  omitting  the  algorithm 
description.  Some  summary  remarks  are  still  in  order,  and  these  will 
be  pi’esented. 

_Eunc tional  Organisation 

SAI  has  five  programs  applicable  to  the  network  design  problem: 

SADC  Systematic  Approach  to  Data  Collection 

SADM  Systematic  Approach  to  Data  Management 

SAND  Systematic  Approach  to  Network  Design 

TALK  A  network  design  package  (full  name  not  available) 

SAPA  Systematic  Approach  to  Performance  Analysis 
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SADC  is  a  data  collection  module  (batch  mode),  whose  function  is  to 
create  a  data-base  from  user-supplied  information  as  to  traffic  flow, 
planned  user  facilities,  equipment,  etc.  This  information  may  then  lie 
used  as  input  to  other  modules. 

SADM  is,  in  the  words  oi  SAI,  "a  general  purpose  data  management 
program,  designed  to  handle  small  to  medium  data  base  applications" 
[PANG78].  Its  use  in  the  network  design  context  is  in  the  development  of 
input  data  for  the  two  major  design  modules,  TALK  and  SANDS,  as  well 
as  SAPA.  Inputs  to  this  module  may  come  from  SADC,  or  be  user-supplied. 

SANDS  is  an  interactive  program  whose  major  purpose  is  to  study 
various  design  reconfigurations.  1'he  user  may  start  with  either  the 
existing  network  or  with  an  initial  design,  and  then  examine  the  costs 
and  benefits  (response  time,  throughput)  for  various  design  changes. 
Analytical  (ostensibly  queueing)  models  arc  used  here. 

SAPA  is  a  set  of  programs  that  performs  the  response  time/throughput 
analysis.  Analytical  models  arc  used  for  single-host  networks,  while 
"simplified  simulation  models"  (in  the  words  of  SAI)  are  used  for  networks 
provided  by  the  various  vendors.  The  output  of  this  module  also  includes 
maximum  permissible  line- loadings  as  well  as  queue  sizes. 

TALK  is  used  either  to  evaluate  an  existing  system  or  to  plan  a  new 
one.  The  review  function  (performed  via  simulation)  examines  the  effects 
of  changing  grades  of  service  on  the  performance  level.  An  auditing 
function  keeps  track  of  existing  communications  facility  inventory,  and 
verifies  the  common  carrier  billings. 


talk  provides  a  network  designer  with  the  following  design  capabilities 


1.  The  User  Profile  Development  Model  characterizes 

the  individual  users,  workload,  and  traffic  requirements. 

2.  The  Remote  Locations  Configurator  determines  equipment 
configurations  and  whether  a  location  should  be  "on"  or 
"off"  the  backbone  network. 

11.  The  Regional  Network  Configurator  partitions  the  net¬ 
work  into  areas  and  designs  a  local  network  within 
each  area  using  concentrators  and/or  multiplexers. 

4.  The  Integrated  Network  Configurator  designs  the 
backbone  network  covering  all  on-net  locations 
and  WATS,  TELEX,  or  a  combination  of  other  ser¬ 
vices  to  cover  all  off-backbone  stations. 

5.  The  Front-End  Configurator  configures  the  front-end 
system. 

In  addition  to  these  five  capabilities,  SAI  maintains  a  collection  of 
data  bases  which  contain  information  on  telecommunications  products 
(modems,  front-ends,  etc.),  common  carrier  tariffs  (e.  g.  ,  AT&T, 
Western  Union),  and  value-added  tariffs  (e.  g.  .  TELENET.  TYMSH  ARE). 

Summary 

As  indicated  before,  the  company  declined  to  provide  specific 
information  on  the  algorithms  employed  in  their  packages.  The  only 
material  provided  was  promotional  in  nature.  Consequently,  no  in-depth 


analysis  of  SAI's  products  can  be  attempted.  However,  some  comments 
may  still  be  made: 


1.  The  ability  to  design  distributed  networks  appears  to  be 
lacking. 

2.  There  are  no  reliability  models. 

:t.  Partitioning  a  network  into  separate  areas  within  which 
centralized  networks  could  be  designed  is  a  difficult 
problem,  especially  in  the  case  of  large  networks.  One 
would  especially  wish  to  examine  the  algorithms 
employed  here. 
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UNIVERSITY  OF  MINNESOTA  -  VANS  (Value-Added  Network  Simulator) 

Professor  M.  Schneider,  Computer  Science  Department 

Professor  Schneider  is  presently  completing  the  implementation  of  the 
VANS  model.  Mis  major  purpose  in  building  it  is  to  produce  a  model  capable 
of  investigating  the  utilization  of  different  network  protocols. 

Protocols,  in  the  context  of  a  computer  network,  are  simply  rules 
which  govern  the  exchange  of  information  between  processes  —  establishing 
and  maintaining  connections,  transferring  messages  after  making  connections, 
etc.  They  must  cover  the  following  areas: 

•  Opening,  maintenance  and  closing  of  connections 

•  Flow  control 

•  Process  initiation 

•  Process  interruption 

•  Development  and  maintenance 

•  Recovery  from  failure 

A  discussion  of  network  protocols  (for  the  interested  reader)  may  be 
found  in  DAVI73,  chapter  13. 

Schneider,  in  "A  New  Methodology  for  Computer  Network  Simulation" 
[SCHN78]  ,  makes  the  point  that  present  simulations  of  computer  networks 
are  what  he  terms  parametric  simulations.  By  this,  he  means  that  the  user 
is  required  to  input  a  list  of  parameters  —  line  loadings,  error  rate,  etc.  — 
but  a  detailed  study  of  different  protocols  and  their  effects  on  the  network  is 
not  possible.  In  contrast  to  this  "genre"  of  simulation,  he  defines 


a  structural  simulation  as  being  characterized  by  the  three  following 
points  [SCHN78]  : 

(a)  All  subnetwork  activity  would  be  factored  into  a  disjoint, 
nonoverlapping  set  of  responsibilities  called  protocol  areas. 

tbl  A  user  would  initially  "build"  the  structure  of  a  sub¬ 
network  by  providing  the  name  of  the  process  which  will 
support  each  of  the  above  protocol  areas.  These  pro¬ 
cesses,  which  can  come  either  from  the  user  or  a 
standard  library,  will  be  collected,  compiled  and  linked 
together  to  form  a  simulation  model  of  a  specific  network. 

(O  A  user  would  parametrize  the  network  just  constructed  by 
providing  those  data  values  consistent  with  and  needed  by 
the  model. 

Such  a  simulation  provides  ttie  user  with  the  abilitv  to  "parametrize," 
as  before,  and  also  to  modify  the  protocol  structure  of  the  network.  As  a 
result,  the  user  could,  for  example,  investigate  protocols  in  the  areas  of 
dynamic  routing,  error  detection  and  correction,  flow  control  techniques, 
line  allocation  policies,  etc.  Schneider  has  built  such  a  model,  and  a  brief 
description  of  its  organization,  together  with  an  interesting  list  of  experi¬ 
ments  one  may  perform  with  it,  is  included  below.  Our  discussion  of  his 
model  follows  his  own  description  in  SC11N78,  and  the  figures  which  accom¬ 
pany  this  discussion  appear  in  that  paper. 

Description  of  VANS  Model 

VANS  is  an  interactive  model,  consisting  of  four  phases,  as  depicted 
in  figure  25.  As  is  indicated  in  the  diagram,  the  user  has  the  option  of 
reentering  any  of  the  preceding  stages  upon  completing  one  run  of  the  model. 


For  a  more  complete  discussion  of  Professor  Schneider’s  model,  the  reader 
is  urged  to  consult  SCHN78  and  SC1IN7(>. 
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The  preprocessing  phase  of  the  model  is  broken  into  two  parts, 
structural  and  parametric  (which  by  now  should  be  suggestive  names),  noth 
of  these  phases  are  written  in  PASCAL  and  are  interactive. 

The  structural  preprocessor  essentially  allows  the  user  to  build 
a  library  of  protocols  to  support  some  32  protocol  areas.  Kaeh  of  these 
32  areas  has  at  least  one  module  corresponding  to  "standard"  protocols  in 
that  area.  A  default  module  has  also  been  designated  in  all  of  these  areas, 
obviating  the  need  for  an  explicit  choice  in  each  area  —  the  user  has  only  t o 
pick  those  that  are  of  interest  to  him.  lie  mat  also  suppl\  a  protocol  module 
written  by  himself,  providing  it  conforms  to  certain  standards. 

The  parametric  preprocessor  is  also,  as  mentioned  before,  w  ritten  in 
PASCAl. ,  and  is  interactive  in  nature.  The  user  inputs  values  for  quantities 
such  as  the  number  of  nodes,  line  speed,  priority  levels,  etc.  Again  default 
values  exist,  and  the  user  is  only  required  to  provide  values  for  parameters 
which  he  selects. 

The  simulation  phase  of  VANS  is  organized  as  a  three-level  process 
tree .  as  shown  in  figure  2C>. 

The  first  two  levels  are  not  accessible  by  the  user,  while  the  third 
level  consists  of  the  protocol  modules,  which  arc  under  the  user's  control. 

The  executive  level  has  responsibility  for  generating  the  level  2  node 
processes,  as  well  as  initializing  all  tables,  constants,  and  queues.  The  six 
level  2  processes  (five  are  shown  in  the  diagram)  are  implemented  by 
S1MI  LA  processes.  Following  Schneider's  description  [SCHN78]  these 
processes  are: 

•  HOST  —  The  HOST  processes  model  the  message  generation 
consumption  operations  of  each  HOST  at  the  node.  There  is  a 
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Figure  26.  Process  Tree  Organization  of  VANS 
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Figure  27.  Diagram  of  a  Network  Node 
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As  shown  in  the 


separate  process  for  each  MOST  type, 
example  in  figure  27 ,  there  would  be  n  separate  HOST 
processes  for  that  node. 

•  Bl- FFER-MANAGER  —  This  process  models  the  operation 
of  the  HOST-CI  Direct  Memory  Access  (DMA)  channels  and 
the  management  of  the  Cl's  primary  buffers.  There  is  a 
single  BUFFER-MANAGER  process  for  each  Cl. 

•  COMMUNICATIONS  INTERFACE  (CD  —  The  Cl  process 
models  all  processing  capability  located  within  the  communica¬ 
tions  interface.  Even  though  the  Cl  processor  may  be  viewed 
as  being  composed  of  a  number  of  identical,  independent  sub¬ 
processors,  there  is  only  a  single  Cl  process  per  network  node. 

•  LINE  INTERFACE  (LI)  —  Each  LI  process  models  the  operation 
of  a  communication  line  interface  with  DMA  capabilities. 

Referring  to  the  example  in  figure  2(5,  there  would  be 

m  distinct  LI  processes  for  this  node. 

•  STORE  —  The  STORE  process  models  the  backing  store  and 

the  DMA  channel  between  the  primary  and  secondary  Cl  buffers. 

•  GREMLIN  —  GREMLIN  is  a  randomly  occurring  process 
causing  unplanned  errors,  failures,  and  outages,  and  in 
general  wreaking  havoc  among  all  network  components . 

All  of  these  processes  are  accomplished  via  calls  to  the  level  3  pro¬ 
cesses  whose  order  of  activation  and  choice  is  determined  by  the  level  2 
processes.  All  of  the  level  3  routines  are  implemented  as  SIMULA  procedure 


In  an  attempt  to  assure  that  a  change  in  a  level  3  procedure  doesn't 
cause  an  unexpected  change  in  another  procedure,  each  protocol  module  must 
conform  to  a  certain  set  of  rules  or  capability  descriptors .  These  miles 
address  the  following  areas: 

•  Requirement  of  a  standardized  calling  sequence,  providing  for 
a  standard  interface  between  all  levels. 

•  Specification  of  the  data  structures  that  the  protocol  module 
is  allowed  to  modify.  Omission  of  a  list  implies  that  no  data 
structures  may  lie  modified  by  a  module  supporting  this 
protocol  area. 

•  A  list  ot  the  other  level  3  modules  it  can  activate. 

•  Actions  a  module  can  perform. 

An  example  of  a  capability  descriptor  is  presented  in  detail  in  Schneider's 
paper. 

The  last  portion  of  VANS  is  a  report  generator.  This  is  a  standard 
post-processor,  which  prints  out  a  list  of  statistics  in  the  following  five 
categories : 

1.  Message  Statistics  —  total  traffic  by  links,  HOST-HOST, 
communications  processor  to  communications  processor. 

l  ime  Kffleiency  Statistics  —  average  message  delays  by 
priority  classes,  HOST -HOST,  communications  processor 
■  communications  processor. 


i't  si  Statistics  of  Various  Resources  —  lines,  processors,  etc. 


hollowing  this  phase,  tho  user  has  tin*  rip.ht  to  reenter  any  preeedinn 
sta^o,  Hiv  ing  him  t ho  opportunitv  to  experiment  further  vvllh  both  tho  physieal 
network  ami  Ilu1  protoeol  slrudinvu. 

t'learh  ,  a  numl'or  ot  faseinafinp.  experiments  max  l>e  eonduotod  with 
this  model.  Sohnoider's  menu  ineludos- 

•  A  study  ot  dvnamio  routing  algorithms. 

•  \  studv  ot  How  eont  rol  ami  deadlook  prevention  algorithms. 

•  \  studv  ol  torvvard  error  oor root  ion  versus  rot  vanstnission 
sehomes  under  different  renditions. 

•  A  studv  ol  mossar.e  versus  p.aeket  svvitehinp  under 
ditto vowt  oondtt  ions . 

•  Kinallv  tatul  most  ambitiously!,  a  possible  utilization  ot 
\  A  NS  as  part  of  a  topolopteal  design  protoet  for  the 
oonst nii't ion  of  oomputor  networks. 

VANS  is.  in  tho  opinion  of  tho  wrifor,  an  exelttt\>;  model  that  shows  the 
potential  for  breaking  new  ground  in  network  design.  As  sueh,  its  further 
progress  should  e«'rtainl\  be  walehod. 


s:’ 


nimv  AssoiMA  n:s  -  npms.  anpms 


Clone  ml 

I'MW  Associates  markets  ;i  series  of  models  directed  towards  the 
design  and  management  of  centralized  networks.  1'ho  models  are  interactive 
in  nature,  and  a  re  available  through  a  national  time-sharing  service 
(TYMS1IAKK1.  The  company  will  also  install  them  on  the  user's  computer 
facilities. 

I’r.  Pixon  Poll,  the  president  of  the  eotnpam  ,  declined  pvith  one 
exeeptionl  to  give  any  information  on  the  algorithmic  structure  of  his 
company's  models,  lie  did,  however,  explain  that  they  were  "Ivittle-tested.  " 
Peseriptive  material  of  a  promotional  nature  was  also  supplied. 

In  view  of  this  lack  of  information,  no  analysis,  however  tentative,  is 
possible,  lienee,  wo  merely  present  a  functional  description  of  the  models, 
and  conclude  with  a  few  general  comments  on  the  product  and  its  applicability 
to  problems. 

I'nnol  ional  1  Vse  rip1  ion 

Figure  :’S  is  a  diagram,  taken  from  the  literature  supplied  by  PMW 
[PMW7S]  ,  which  summarizes  the  major  modules  in  PMW's  package  as  well  as 
their  functions.  ihe  discussiott  that  follows  refers  to  this  diagram. 

Vhe  moo. ile  termed  HKSl'ONSF  I  kasieallv  generates  a  lino-loading 
table,  given  a  response-lime  goal,  thus  permitting  the  user  to  compare  dif¬ 
ferent  line  disciplines,  message  priorities,  etc. 

11  IS  I  \  'NSI '  computi's  Ihe  response  time  for  a  line  that  has  already 
U'en  configured;  that  is.  its  traffic,  numU'r  of  terminals,  etc.  has  already 
U'en  determined.  As  such,  its  primarv  use  is  to  estimate  response  time 
once  the  network  has  been  designed  In  the  PKSU1N  module  of  NPMS. 


Network  Desis?n  and  Analysis  Models 


NDMS  (Network  Design  and  Management  System)  solves  (lie 
concentrator-assignment  and  line-la\out  problem.  Dr.  Doll  mentioned  that 
the  Esau-Williams  algorithm  is  at  the  heart  of  this  module.  A  number  of 
options  for  the  use  of  this  module  are  available.  For  example,  the  user  can 
perform  his  own  concentrator  placement  and  rely  on  a  subset  of  NDMS  to 
perform  the  line-layout  activity.  He  may  also  choose  to  develop  his  own 
line-loading  tables  instead  of  relying  on  RESPONSE  1.  ANDMS  (Advanced 
Network  Design  and  Management  System)  is,  as  is  evident  from  figure  28, 
an  extension  of  NDMS.  In  addition  to  NDMS,  ANDMS  contains  a  data-base 
management  subsystem,  DBMS,  that  can  be  used  to  keep  track  of  and  con¬ 
trol  user  locations  and  facilities  (modems,  terminals,  etc.).  This  informa¬ 
tion  may  then  lie  passed  on  to  the  DESIGN  module  as  well  as  to  the  ANALYSIS 
module.  Temporary  data  bases  may  also  Ix>  created,  merged,  and  stored. 

The  analysis  module  (aptly  named  ANALYSIS)  is  used  for  determining 
response  time  once  the  DESIGN  module  has  completed  its  activities.  It  is 
not  clear  how  this  submodule  differs  from  RESPONSE  2. 

In  addition  to  the  above  network  design  products,  DM\Y  provides  other 
management  tools  for  communieations-related  expenses. 

TELE  COST  provides  reports  on  telephone  utilization  and  costs.  For 
example,  one  may  obtain  detailed  summaries  of  long-distance  phone  calls, 
trunk  group  usage  (WATS,  FX,  Direct  Dial,  etc. ),  lists  of  frequently  called 
numbers,  etc. 

WATSTOLL  provides  information  on  the  costs  and  utilization  of  WATS 
lines,  and  also  determines  an  optimal  mix  of  tariffs  and  facilities. 

( >nl i ne  Tariff-Guide  is  an  interactive  program  for  determining  the  price 
schedule  for  With  point-to-point  and  multipoint  circuits.  It  contains  an  exten¬ 
sive  directory  of  common-carrier  tariffs. 


Summary 


As  no  algorithmic  information  was  made  available  (with  the  exception 
of  the  DESIGN  module),  not  much  can  bo  said  about  DMW's  package.  It 
evidently  suffers,  however,  front  the  lack  of  a  distributed  network  design 
capability  and  reliability  capabilities. 

Given  Dr.  Doll's  association  with  IBM's  Systems  Research  Institute, 
and  his  use  of  the  Esau-Williams  algorithm  (which  constitutes  an  important 
building  block  of  C'NDP),  one  may  reasonably  speculate  that  the  remainder 
of  his  package  is  also  tailored  along  the  lines  of  CNDP. 


NETWORK  ANALYSIS  CORPORATION  (NAC)  -  GRINDER,  MIND 
General 


NAC  has  developed  tools  for  the  design  of  centralized  systems  (MIND), 
as  well  as  distributed  systems  (GRINDER)  —  it  is  the  only  vendor  that  sup¬ 
plies  a  package  capable  of  designing  distributed  networks.  Reliability  models 
are  included  with  both  GRINDER  and  MIND.  Again,  NAC  is  alone  among  the 
vendors  in  offering  reliability  models,  a  feature  clearly  of  great  importance 
in  military  circumstances.  Both  of  these  tools  consist  of  a  series  of  modules, 
which  are  written  in  FORTRAN,  and  are  interactive  in  nature.  The  modules 
can  also  be  utilized  in  a  batch  mode.  They  are  available  through  a  national 
time-sharing  service  or  can  be  installed  on  the  user's  computer. 

Functional  Description 

Figure  29  (reprinted  from  the  literature  supplied  by  NAC)  presents 
a  view  of  the  various  modules  in  GRINDER.  Note  that  it  clearly  illustrates 
the  relationship  that  MIND  bears  to  GRINDER.  The  functions  of  each  of  the 
boxes  in  the  diagram  will  lie  discussed  first. 

Input  Module  and  Data  Base 

The  data  base  has  such  information  as  tariff  structure,  device  char¬ 
acteristics,  etc.  The  user  can  interact  with  this  data  base  by  fetching  and 
modifying  those  portions  which  are  relevant  to  his  problem.  He  can  also 
create  his  own  files  as  part  of  the  data  base. 

Network  Configuration  and  Hardware  Alternatives  Modules 

Each  of  these  modules  require  inputs  consisting  of  various  device  and 
traffic  characteristics. 
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CPI’  Configuration.  This  module  evaluates  the  cost-effectiveness  of 
a  particular  central  site  configuration  (i.  e.  ,  a  combination  of  central 
processor(s)  and  peripheral  devices).  Its  outputs  include  system  bottle¬ 
necks,  throughput  and  cost  for  the  various  device  characteristics,  as  well 
as  for  different  distributions  of  CPU  processing  time,  and  different  message 
arrival  rates. 

1  enninal  Configuration.  This  module  evaluates  the  cost-effectiveness 
and  throughput  capabilities  of  different  terminal  configurations.  One  topic  of 
interest  here  is  the  use  of  intelligent  terminals  versus  slave  terminals 
employed  with  line  controllers.  The  number  of  terminals  for  a  given  through¬ 
put  and  the  cost  of  that  throughput  are  the  basic  outputs  of  this  module. 

Number  and  Location  of  Multiplexors  and  Concentrators.  This  module 
portrays  the  effect  of  each  of  these  devices  on  the  network's  response  time 
as  well  as  its  reliability. 

Network  Simulation  Module 

This  module,  along  with  its  "distributed"  companion,  performs  the 
response  time  throughput  analysis  for  the  network.  It  is  a  hybrid  model  in 
that  it  is  a  simulation,  with  analytic  models  employed  at  the  device  level. 

The  parametric  input  to  this  module  includes  message  length  distribu¬ 
tions,  interval  time  distributions,  number  of  terminals  per  line,  polling  and 
addressing  sequence  lengths,  device  turnaround  times,  etc.  The  output  of 
the  module  can  be  presented  in  the  form  of  an  overall  response  time,  or 
waiting  times  in  various  devices.  Response  time  as  a  function  of  throughput 
is  also  an  option.  This  module  is  used  to  produce  line-loading  tables  as  an 
input  for  the  topological  design  modules.  Various  other  options  for  input  and 
output  are  also  available. 
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Reliability  Module 

The  function  of  this  module,  as  clearly  implied  by  the  name,  is  to 
calculate  various  measures  of  network  reliability. 

The  module  requires  as  inputs  a  given  topological  design  as  well  as 
device  failure  rates  (in  the  form  of  mean  time  to  failure,  mean  time  to 
recovery,  probability  of  failure,  etc.). 

The  output  can  be  presented  in  the  form  of  'me  fraction  of  node  pairs 
that  are  not  able  to  communicate. 

Programs  for  calculating  the  reliability  of  "standard"  network  con¬ 
figurations,  such  as  trees,  loops,  series  parallel  networks,  etc.  ,  have  been 
written.  Since  networks  are  often  a  combination  of  the  above  topologies 
(as  well  as  a  backbone),  the  above  programs  may  often  be  combined. 

Topological  Design  Modules 

There  are  two  design  modules  in  the  package.  One  corresponds  to  a 
centralized  network,  while  the  other  corresponds  to  a  distributed  network. 

The  centralized  design  module  addresses  the  problems  of  clustering  the 
terminals,  concentrator /multiplexor  placement,  line-layout  and  capacity 
allocation.  The  distributed  design  module,  in  addition  to  solving  the  same 
problems  as  the  centralized  design  module,  addresses  the  routing  issue  as 
well  as  the  backbone  design  problem  for  large  networks. 

Program  Manager 

This  module  has  two  functions.  It  allows  information  to  be  passed 
back  and  forth  in  between  modules.  For  example,  the  results  of  the  response 
time  module  can  be  passed  on  to  the  design  module,  and  vice  versa,  thus 
allowing  an  interaction  to  occur  that  can  successively  produce  more  effective 
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designs.  The  Program  Manager  module  also  produces  a  variety  of  designs 
under  different  assumptions.  Thus  for  a  variety  of  different  traffic  levels, 
one  can  obtain  corresponding  designs  and  the  costs  for  different  response- 
time  /throughput  requirements. 

Algorithmic  Structure 

We  divide  our  description  of  NAC's  model  into  three  parts.  The  first 
part  deals  with  three  modules  common  to  the  design  of  both  centralized  and 
distributed  networks  —  the  reliability  and  CPU  configuration  modules.  The 
second  part  deals  with  the  design  of  centralized  networks;  and  the  last  section 
presents  the  design  of  distributed  networks,  including  a  discussion  of  the 
backbone  design  problem  for  large  distributed  networks. 

Models  Common  to  Roth  Centralized  and  Distributed  Network  Design 

CPU  Configuration  Module.  This  module  evaluates  the  utility  of  vari¬ 
ous  hardware  configurations.  It  is  applicable  to  both  central  site  configura¬ 
tions  and  concentrators.  The  module  essentially  determines  the  following- 
characteristics  for  different  "conditions"  —  capacity,  response  time, 
throughput,  and  the  location  of  processing  bottlenecks.  The  conditions  are 
the  following  design  variables; 

•  Hardware  configurations 

•  Software  design  (i.c.  ,  processing  cycles  of  transactions) 

•  Traffic  characteristics,  both  demand  and  service 

The  model  employed  is  based  upon  a  paper  by  Chou  and  McGregor 
(CHOU751  ,  entitled  "A  Unified  Simulation  Model  for  Communication 
Processors."  In  this  paper,  a  methodology  for  constructing  such  a  simula¬ 
tion  is  developed  by  employing  a  queueing  model  of  a  communications 
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processor,  and  then  basing  a  discrete  event-stepped  simulation  on  this 
model.  This  hybrid  approach  combines  the  advantages  (and,  one  hopes, 
eliminates  the  disadvantages)  of  the  pure  analytic  and  the  pure  simulation 
approach  to  the  problem.  The  advantage  of  simulating  a  queueing  model  as 
opposed  to  the  actual  system  is  the  gain  in  speed  of  the  simulation  without 
an  attendant  loss  of  accuracy.  This  is  unfortunately  too  often  the  case  when 
one  relies  strictly  upon  queueing  models. 

The  model  itself,  shown  in  figure  30,  is  a  collection  of  interconnected 
queueing  structures.  There  are  four  servers  (each  possessing  its  own 
queue)  —  the  CPI',  the  peripheral  devices,  the  I/O  channels  associated  with 
the  peripheral  devices,  and  the  communications  lines. 

The  CPI'  is  modeled  as  a  single-server  queue.  Inputs  may  come  from 
the  communication  lines,  the  peripheral  devices,  or  the  CPU  proper. 

Outputs  are  directed  to  these  same  devices.  The  service  time  for  the  CPU 
is  considered  to  be  the  processing  time. 

The  peripheral  devices  are  modeled  as  a  collection  of  single-server 
queues.  In  case  of  a  disk,  transactions  fir.  t  queue-up  for  access  to  the 
desired  disk,  and  are  then  serviced  provided  that  no  other  transaction  is 
making  use  of  the  disk.  The  service  time  is  the  disk-seeking  time.  After 
completion  of  this  activity,  a  queue  is  formed  "in  front  of"  the  I/O  channel 
of  the  disc  controller. 

The  I/O  channel  of  a  device  controller  is  portrayed  as  being  shared  by 
a  number  of  devices,  while  the  controller  governs  the  activity  of  one  device 
at  a  time.  The  serv  ice  time  is  the  read /write  time. 

The  communication  lines  are  modeled  as  a  collection  of  single¬ 
server  queues. 
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del  of  a  Communications  Processor 


Kaeh  transaction  is  viewed  as  proceeding  from  one  server  to  another 
as  it  progresses  through  the  computer.  Multiple  entries  to  any  device,  as 
well  as  simultaneous  service  ot  one  transaction  by  a  number  of  devices,  are 
included  as  part  of  the  model. 

The  simulation  was  written,  as  previously  indicated,  to  be  sensitive 
to  changes  in  hardware,  the  transaction  processing  cycle,  and  traffic  char¬ 
acteristics.  Hence,  a  hierarchy  of  levels  corresponding  to  the  above  three 
system  characteristics  was  dev  ised  as  the  basis  tor  implementing  the  program. 

The  hardware  configuration  level  occupies  the  highest  rank  of  the 
hierarchy.  It  is  implemented  as  a  table  ot  hardware  devices,  and  associated 
transaction  times  for  each  dev  ice. 

The  software  design,  or  transaction  processing  cycle  is  represented  by 
a  state  vector  corresponding  to  each  facility  (server),  and  a  cycle  vector 
corresponding  to  each  transaction  type.  The  state  vector  associated  with 
each  device  characterizes  its  state  by  an  interrupt  time  distribution,  a  service 
time  distribution,  am  a  priority.  The  cycle  vector  in  turn  specifies  the 
sequence  of  servers  as  well  as  their  states.  These  two  vectors  provide  the 
linkage  between  the  various  dev  ices. 

The  demand  service  characteristics  are  specified  via  average  message 
lengths  and  message  length  distributions,  and  average  message  arrival  rates 
and  inter-arrival  distributions. 

The  interested  reader  should  consult  CllOl'75  for  more  details;  another 
paper  of  interest  on  this  specific  approach  [  MCGR75a  1  might  also  be  consulted. 


UcltabtUty  Modulo.  In  this  discussion,  we  focus  on  ;in  analysis  model 
for  network  reliability.  Our  discussion  is  based  upon  papers  by  Van  Slyko 
ami  Frank  [VANS?'.!]  and  Frank  and  Chou  |  FKANT1?  1  . 

i'h.e  measure  of  reliability  that  is  employed  in  this  model  is  the  average 
fraction  of  node  pairs  that  cannot  communicate  U'cause  of  node  and  link 
failures.  W  ith  this  definition  in  mind,  we  formulate  the  analysis  problem  in 
terms  ot  the  probability  of  the  network  U'ing  disconnected.  Specifically , 
given  a  network  with  N  nodes  and  A  arcs,  as  well  as  a  probability  pot 
failure  for  each  of  these  elements,  estimate  the  probability  h< p>  ot  the  net¬ 
work  U'ing  disconnected  as  a  result  of  component  failures  for  P  values  of 
the  probabflitN  p.  One  may  also  wish  to  estimate  the  expected  number  ol 
node  pairs,  tup).  not  able  to  communicate  for  1’  values  of  p. 

Two  approaches  are  presented,  the  functional-simulation  approach, 
and  the  Moo  re-Shannon  approach,  both  approaches  Ivnefit  trom  an  algorithm 
that  is  used  to  calculate  the  number  of  components  in  a  graph  with  N  nodes 
and  A  arcs.  This  algorithm  operates  by  successively  adding  one  branch  at  a 
time  to  the  network,  and  then  calculating  the  number  ot  components  in  the 
resultant  graph. 

F.mploying  the  notation  A  for  the  graph  formed  upon  the  addition  of 

k 

link  a  to  the  graph  a  ,  we  now  present  the  algorithm: 

k  k  - 1 

Step  0  Initialization 

l,et  A  0,  and  assign  a  separate  laU-l  to  each  node. 

Set  k  0  (k  is  used  to  keep  track  of  the  number  of  nodes  added  1. 

Network  reliability  (survivability)  is  a  major  subject  in  its  own  right,  with 
many  results  scattered  throughout  the  literature.  A  comprehensive  survey 
of  the  results  is  contained  in  a  paper  by  Frank  and  Frisch  f  FHANTOa  |  . 


Stop  I  Add  link  to  A^.  If  a^  -  (n^,  n,),  then  examine  the  labels 
on  iv  and  n..  If  they  are  different,  no  to  stop  2.  Otherwise, 
set  k  k  *  1,  and  repeat  step  l . 

Step  2  Change  all  of  the  node  labels  that  are  the  same  as  the  label  of 

n,  to  the  laltel  of  n  .  Set  k  k  •  1,  and  cheek  to  see  If 

A  =  A  (i.e.  ,  if  all  the  arcs  have  Ix'en  added).  If  so,  STOP, 

K  *  1 

Otherwise,  go  to  step  1. 

As  one  proceeds  through  the  algorithm,  one  may  keep  statistics  as  to 
the  number  of  communicating  node  pairs,  the  number  of  nodes  in  each 
component ,  etc. 

We  now  proceed  to  a  discussion  of  the  two  simulation  approaches 
employed  in  NAt"s  reliability  models. 

1.  The  Functional  Simulation  Method.  This  suggestive  name  is 
employed  because  in  this  method,  the  expected  number  of  node  pairs  com¬ 
municating  with  one  another  are  calculated  as  a  function  id'  the  prolvtbility  of 
link  failure.  Based  upon  work  by  Hammersly  and  llandseomb  [1IAMM(>4|  , 
the  method  is  as  follows. 

For  each  link,  a  random  number  between  0  and  1  is  generated,  and 
they  are  then  listed  in  decreasing  order.  As  is  usual  in  Monte  Carlo  simu¬ 
lations,  if  p  is  larger  than  a  random  number,  then  the  link  associated  with 
that  random  number  is  taken  out  of  the  network,  while  if  p  is  smaller  than 
the  number,  the  link  is  left  in,  and  the  resulting  network  is  analyzed  for  its 
connectivity.  Hence,  for  the  given  sequence  of  random  numbers 

r.  >  r  ,>...>  r  ,  if  p  >  r  all  the  links  are  removed.  If  r  >  p  >  i* 
12  na  1  12 

then  only  the  first  link  remains,  and  in  general,  if  r^  >  p  >  r,  ^ ,  then  the 
network  having  links  n^,  ...  ,  a^  is  examined  for  connectivity. 


By  us  inn  the  connectivity  algorithm  just  described,  however,  the 
network's  connectivity  can  lx>  evaluated  for  all  P  values  of  p  in  one  pass, 
thus  saving  a  considerable  amount  of  computation  time.  Phis  is  seen  as 
follows:  first,  order  the  collection  of  probabilities  and  the  random  numbers 
generated  in  one  sequence,  e.  g.  ,  r^  <  p]  r  (  ...  p  <  r  .  Then, 

adding  one  link  at  a  time  as  permitted  by  the  above  algorithm,  analyze  the 
resulting  networks  for  connectivity,  keeping  track  of  the  appropriate  sta¬ 
tistics  (o.  g.  ,  node  pairs  which  can  communicate,  etc.),  if  so  desired. 

“•  Moo  re -Shannon  Approach.  1  last'd  on  a  paper  by  Moore  ami  Shannon 
[M(X)H5(>]  ,  this  method  estimates  the  probability  of  the  network  tx'ing  dis¬ 
connected,  h(p),  via  the  following  formula: 

N-A 

,,  .  ,  N  i  A-k  k 

h(p)  =  y  c(k>  p  cl 

k  -() 


where  q  =  1  -  p  and  ctk)  is  the  number  of  networks  having  k  elements  t  i.e.  , 
nodes  and  arcs)  which  are  disconnected. 

A  similar  tormula  is  employed  to  estimate  the  number  of  node-pairs 
that  are  not  able  to  communicate: 

N  i  A 

IMP)  =  D(k)  pN  A~k  qk 
k-0 


where  D(k)  is  the  number  of  node  pairs  unable  to  communicate,  given  that 
there  are  k  arcs  in  the  network. 
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In  evaluating  the  formula  for  h(p),  a  simple  combinatorial  result  is 
brought  to  bear.  If  c  is  the  minimum  number  of  arcs  required  to  disconnect 
the  network,  then  there  are  only  A  -  c  *  2  non-trivial  terms  in  the  expansion. 
This  is  so  because: 

1.  If  the  network  has  n  nodes,  it  takes  at  least  n  -  1  arcs  to 
connect  them.  Hence  c(k)  (  when  k  =  0,  1,  ...  ,  n  -  2. 

2.  If  c  is  the  minimum  number  of  links  that  must  be  deleted  to 
disconnect  the  network  (i.e.  ,  the  minimum  cut-set),  then 
C(NA  -  k)  =  0  for  k  =  0,  1 ,  .  .  .  ,  e  -  1 . 

In  addition  to  this,  if  is  sufficiently  small,  then  c(k)  can  l>e 

calculated  by  enumeration  —  if  not,  they  may  be  estimated  by  simulation. 

In  the  event  that  they  are  estimated  via  simulation,  the  theory  of  stratified 
random  sampling  is  applied.  A  discussion  of  this  technique  may  be  found  in 
FISZ63. 

The  algorithm  for  computing  the  number  of  components  ol  a  graph  can 
also  be  applied  here. 

Centralized  Network  Design 

Two  of  the  fundamental  questions  in  the  design  of  centralized  networks 
have  been  discussed  in  section  2  —  the  location  of  concentrators  (multiplexors) 
and  the  line-layout  problem.  NAC's  approach  to  the  line-layout  problem  — 
the  I'nified  Heuristic  Algorithm,  due  to  Kershenbaun  and  Chou  —  has  been 
discussed  in  that  section.  Hence,  the  main  topic  of  this  section  falls  under 
the  following  heading. 
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Network  Parlitioning/Concenlrator  Location  Module 

This  algorithm  is  referred  to  as  the  COM  (center-of-mass)  algorithm. 
It  functions  as  a  local-access  designer  by  first  partitioning  the  users  in  the 
network,  then  assigning  them  to  local  access  facilities  (i.e.  ,  concentrators 
or  multiplexors),  and  finally  tying  the  users  via  their  local  access  facility 
to  a  central  computer. 

employing  NA(  's  own  terminology,  a  resource  connection  point  will  be 
called  a  RKSCOP,  while  a  local  access  facility  will  be  referred  to  as  a 
I  A  COM  (the  Til'  and  ANTS  of  ARPANKT  fame,  concentrator  or  multiplexor). 
The  choice  of  this  terminology  reflects  the  generic  nature  of  the  access 
location  problem. 

The  access  location  problem  may  be  formulated  as  follows. 

Given  : 

•  A  set  of  nodes 

•  A  particular  node  that  serves  as  a  RKSCOP 

•  Constraint  on  the  number  of  nodes  that  may  share  a  line 

•  Constraint  on  the  number  of  nodes  Hint  a  TACOM  may  serve 

•  Cost  of  connecting  nodes 

•  Cost  of  TACOM 

Kind:  A  low-cost  feasible  design  employing  TACOMS 

Our  description  of  the  algorithm  follows  the  one  given  by  NAC.  We 
tirst  present  a  brief  summary  of  the  algorithm,  and  then  expand  the  discus¬ 
sion  to  Include  some  of  the  details.  The  interested  reader  is  referred  to 
MCGRTob. 
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The  algorithm  may  be  outlined  in  four  steps,  as  follows. 

1.  Reduce  the  problem  to  a  point-to-point  problem  by  replacing 
clusters  of  nodes  by  single  eenter-of-mass  (COM)  nodes. 

(Hence  (lie  title  of  the  algorithm. ) 

-•  Apply  an  add  algorithm  to  the  resulting  collection  of  COM 
sites,  resulting  in  a  placement  of  TACO  MS  at  several 
COM  sites. 

3.  l  ocate  the  chosen  TACOMS  at  real  nixies  by  choosing  the 
original  nodes  closest  to  the  designated  COM  nodes. 

1.  Apply  a  line-layout  algorithm  to  each  partition,  utilizing 
the  TACOM  as  the  central  node. 

A  simplified  flow  ehart  taken  from  MCGR75b  is  presented  in 
figure  Ml,  and  a  series  of  diagrams  depicting  the  outcomes  of  the  four  steps 
is  presented  in  figures  32  through  :if>.  In  the  following  expanded  discussions 
of  each  of  the  four  steps,  numerals  in  parentheses  refer  to  the  flow  ehart 
(figure  .11). 

Step  1 :  clustering.  Clusters  of  nodes  are  formed  by  "snowballing" 
nodes  together.  The  two  nodes  closest  together  are  selected  (2),  and  line¬ 
loading  constraints  are  checked  (1).  If  they  were  not  violated,  the  two  nodes 
are  replaced  by  a  new  nolle  at  the  eenter-of-mass  of  the  original  two  nodes  (4). 
1  he  nodes  start  with  a  weight  of  one.  The  weight  then  becomes  the  number 
of  nodes  in  the  cluster.  Nodes  that  cannot  be  merged  are  removed  from 
consideration.  1  he  "snowballing"  continues  until  all  nodes  belong  to  their 
own  cluster,  as  portrayed  in  figure  12. 
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I  ipiut'  .U  (concluded).  Flow  Chart  of  COM  Algorithm 
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Slop:*:  Partitioning.  \n  add  algorithm  is  applied  to  the  COM  sites  in 
order  to  loeate  1  ACOMs  , 7 > .  \fter  choosing  eaeh  sueeessive  TACOM  site, 
an  heuristic  algorithm  incorporating  the  weight  of  the  nodes  is  applied, 
resulting  in  a  possiblv  new  choice  for  the  TACOM.  II  the  choice  is  cost- 
effective  i M ,  the  1'ACOM  is  kept ,  and  the  process  continues  p«).  If  not,  all 
the  remaining  COM  nodes  are  directly  connected  to  the  central  site  ^  1  . 

The  output  of  this  step  is  shown  in  tigure  :>:i. 

Step  a:  l  ocal  t >pt imi/at ion.  Step  results  in  the  selection  of  a  par¬ 
ticular  COM  site  as  a  1'ACOM  location.  because  this  site  can  easily  he  an 
idealized  site  —  i.e.  ,  not  an  actual  node  —  it  is  neeessarv  to  locate  the 
TACOM  at  an  actual  nixie,  To  accomplish  this,  several  of  the  nodes  closest 
to  the  ideal  TACOM  site  are  evaluated  (101,  and  the  most  cost-effective  is 
chosen.  This  step  is  depicted  in  figure  li  t. 

Step  1 :  1  inc  1  avont .  At  this  stage  a  line-lav  out  algorithm  (i\  g.  ,  unified 
Algorithm)  is  applied  to  the  given  partition  (Mi.  A  cheek  is  made  to  see  if 
am  nodes  remain  (1121.  It  am  remain,  the  add  algorithm  of  step  L’  is  applied 
again.  It  not,  the  design  is  finished,  the  result  of  the  fourth,  and  final  step 
is  depicted  in  figure  ;if>. 

Distributed  Network  lVsign 

As  mentioned  in  section  2  of  this  paper,  NAC’s  model  for  the  design  of 
distributed  networks  is  the  cut-saturation  algorithm.  An  outline  of  the 
algorithm  was  presented  in  that  section,  leaving  us  with  only  two  remaining 
tasks.  These  are  a  discussion  ot  the  flow  assignment  tor  routing!  algorithm 
which  is  embedded  in  the  cut-saturation  algorithm,  and  a  discussion  of  the 
backlxme  design  algorithm  at  the  heart  of  large  network  designs. 


An  efficient  implementation  of  the  cut-saturation  algorithm  is  dependent 
upon  a  fast-routing  algorithm.  The  algorithm  developed  is  essentially 
an  outgrowth  of  the  Flow-Deviation  algorithm  which  was  discussed  in  section  2. 
As  was  noted,  the  developers  of  the  Extremal  Flows  algorithm  have  essen¬ 
tially  employed  a  gradient-projection  method  to  speed  up  its  rate  of  conver¬ 
gence,  and  thus  make  it  more  amenable  for  its  inclusion  in  the  cut-saturation 
algorithm. 

Flow  Assignment.  The  algorithm  for  flow  assignment  is  based  on 
a  paper  by  Cantor  and  Gerla  f  CANT 7-1 1  ,  and  is  called,  appropriately  enough, 
the  Extremal  Flows  (FF)  algorithm. 

We  should  remind  the  reader  at  this  juncture  of  some  remarks  made  in 
section  2  concerning  the  nature  of  routing  algorithms  employed  during  the 
design  phase  of  a  network.  Of  necessity,  these  algorithms  provide  a  fixed 
routing  policy  ii.e.  ,  one  that  does  not  vary  with  time).  Because  of  the  inter¬ 
dependence  of  routing  with  other  design  considerations,  we  are  in  need  of 
a  routing  policy,  and  so  we  settle  for  one  that  will  provide  us  with  a  minimum 
average  time  delay  design. 

Cantor  and  Gerla  approach  the  problem  via  mathematical  programming. 
The  solution  is  exact  in  that  a  global  optimum  may  be  achieved. 

The  EF  algorithm  is  essentially  an  improved  version  of  the  Flow- 
Deviation  algorithm  discussed  in  section  2.  The  improvement  is  to  be  found 
in  the  use  of  a  gradient-projection  method  to  speed  up  the  rate  of  convergence 
of  the  algorithm.  This  is  significant,  because  an  efficient  implementation  of 
the  cut-saturation  method  is  dependent  upon  a  fast  routing  algorithm. 


The  problem,  as  formulated  by  Cantor  and  Gerla,  is  as  follows: 

C.iven:  A  directed  network  of  NN  nodes  and  NA  arcs,  and 

nnxnn  requirements  matrix  R  =  [r..j  whose  entries 
consist  of  flows  between  nodes  i  and  j, 

* 

Minimize  over  f  (network  flow  vector) 


T  = 


y 


NA 


subject  to  the  following  constraints : 

•  f  is  a  multicommodity  flow  satisfying  the  requirements 
matrix  R 

•  f  ^  C,  where  the  equation  C  =  (C  ,  ...  ,  C^)  represents 
the  capacity  constraints  on  the  flows. 


Cantor  and  Gerla  first  note  that  the  expression  for  T  can  be  reformu¬ 
lated  in  terms  of  shortest  route  flows.  They  do  this  by  noting  that  the  extreme 
points  of  the  set  of  feasible  flows  can  be  shown  to  be  shortest  route  flows. 

This  conclusion  is  derived  from  the  following  two  facts:  (1)  f  is  a  multi- 
commodity  flow  satisfying  a  requirements  matrix,  and  (2)  the  collection  of 


This  expression  corresponds  to  the  one  given  for  T  in  section  2  as  follows. 
First  assume  that  p’  =  p,  and  set  k  and  P  equal  to  zero  (that  is,  ignore 

NA  A.  /  .  \ 

them).  We  obtain  T  =  /  .  — 


i=l 


ipC.  -  X. 
\1  1/ 


,  then  divide  top  and  bottom  by  p, 


obtaining  the  above  expression. 
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multicommodity  flows  satisfying  the  requirements  matrix  R  is  a  closed 
and  bounded  convex  polyhedron. 

Cantor  and  Gerla  refer  to  shortest  route  flows  as  extremal  flows  and 

represent  f  as  a  convex  combination  of  a  collection  of  extremal  flows, 
i 

<f>  ,  i  =  1,  . . .  ,  r. 


r 


i-  1 


where  \^  =  1 , 
i=l 


0,  i  -  1, 


r 


Figure  3<>  is  a  geometrical  representation  of  this  situation. 


0  13)  0  (4) 


Hast-J  mi  CAN  174.  j<  I(>(i4. 

Figure  ;K>.  Feasible  Region  of  Flows  and  Extremal  Points 
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i  8f. 


h  - 


0  T 

rj f  can  ***  thought  of  as  the  increase  in  time-delay,  given  an  infinitesimal 
i 

increase  in  the  flow  f.  on  channel  i.  Having  thus  defined  1  we  are  in  a 
position  to  define  (lie  quantities  which  are  instrumental  in  stating  the 
Kulm- Tucker  theorem. 


E  *,*? 


One  may  interpret  p^  as  the  increment  in  T,  given  an  infinitesimal 
increment  in  V or  alternatively,  as  the  cost  of  the  extremal  flow  cf>k, 
given  a  cost  of  1.  per  infinitesimal  increase  in  the  flow  on  arc  i 


The  authors  then  point  out  that  at  most  NA  +  1  extremal  flows  are 
necessary  (i.e.  ,  r  s  NA  t  1,  where  NA  is  the  number  of  arcs).  This  turns 
out  to  have  significant  implications  for  the  algorithm  proper,  because 
a  pivoting  operation  may  be  employed  to  reduce  the  extraneous  flows. 

In  order  to  determine  conditions  for  an  optimal  feasible  flow, 
t  antor  and  Gerla  invoke  the  Kuhn-Tueker  conditions  for  optimality.  Ik' fore 
stating  them,  we  offer  some  preliminary  remarks. 
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WUh  t  ho  so  definitions  out  of  the  wav,  the  Kuhn-Tuoker  conditions  may 
lie  written  as  : 


^k  = 


>  f' 


0  ...  \>0 

it 


0 


\  ° 


for  each  k 


rtio  interpretation  ot  these  conditions  is  that  essentially  all  non-zero  extremal 
Oows  must  have  the  same  cost  p  (i.e.  ,  the  same  incremental  delay),  and 
that  any  extremal  flow  with  a  cost  greater  than  p  must  be  zero.  After  some 
further  manipulation,  the  authors  point  out  that  the  conditions  are  equivalent 
to  tlu'  statement  that  f  is  optimal  if  and  only  if  there  exists  no  flow  w  ith 
a  smaller  delay  increment.  Since  one  can  verify  this  with  a  shortest-route 
calculation,  a  valuable  bit  of  information  has  been  obtained. 

We  now  present  the  Extremal  Flows  algorithm,  which  uses  a  shortest- 
distancc  algorithm  to  generate  extremal  flows,  and  then  employs  a  gradient- 
projection  algorithm  to  minimize  the  time  delay  T  over  the  set  of 
extremal  flows. 


Step  0 


Initialization 

Let  b  =  NA  •  l ,  the  number  of  current  extremal  flows 
_  /  1,0  b,0\ 

•r  =  Ic')  ,  . .  .  ,  c*>  )  ,  the  initial  set  of  extremal  flows 

0  /  0  0  \ 

\  f  A  j ,  ...  ,  X,  |  ,  the  initial  basic  solution 
b 


the  initial  feasible  flow 


n  =  1 
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Step  1 


Gradient  Projection 


The  gradient,*  defined  by  V^T  =  (jj . ),  is  projected 

on  the  hyper-plane  X^A.-  1  0.  Hy  setting  the  components  of 

V^T  which  point  in  unfeasible  directions  equal  to  zero, 
a  feasible  downhill  direction  (-V  T')  is  obtained.  The  following 
minimization  is  performed  at  each  iteration: 


where 


<v  -  r 

/(  V  T’  ) 

if  /v  ’ P 

)  >  °’ 

i  = 

i  / 

\  A  /, 

A 

1 

\ 

b 


The  two  steps  are  repeated  until  |  |  V,  T*  |  |  =  0,  at  which  point 

A 

tlie  Kuhn-Tucker  optimality  conditions  are  satisfied. 

Step  2  Pivot  Step 

The  purpose  of  this  step  is  to  eliminate  extraneous  basic 
solutions  in  preparation  for  t he  next  step. 

II  b  NA  *  1 ,  let  'f>  ■»  iji1'  t 

and  go  to  stop  .!  It  1>  NA  •  2,  eliminate  one  of  the  extremal 
flows  with  a  pivot  operation. 


The  reader  who  is  not  conversant  with  mathematical  programming  might  well 
wish  to  consult  11  ADI, (14  for  a  general  discussion  of  the  gradient - 
projection  method. 

Ill 


Step 


Step  1 


Step  3 


Shortest -Route  Computation 

Employing  any  efficient  shortest-route  algorithm  (e.g.  ,  FI.OY(>2L 
compute  the  shortest  route  flow  under  the  metric  1  . 

where 

Vi =  (nT 

as  defined  previously. 

Stopping  Rule 


i-1 


Of 


IF  fln  <r  e  .  for  some  previously  defined  e  ,  stop. 
El.SK,  go  to  step  5. 

Let  b  =  NA  +  2 


(NA*  2),  n  n 

</> 


0 


n  -  n  »  1 


Go  to  step  1. 
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A  starting  feasible  flow  is  obtained  by  relaxing  capacity  constraints 
and  introducing  penalty  functions  outside  the  feasible  region.  For  an  exact 
formulation,  the  reader  should  consult  ('ANT74. 

Bachlx) nc  Design  Model.  The  models  developed  by  NAD  are  based  on 
a  paper  [HSlK7(i]  entitled  "Locating  Backbone  Switches  in  a  Large  Backet 
Network.  "  Our  discussion  of  NAT's  model(s)  is  based  upon  this  paper. 

At  the  outset,  it  is  important  to  note  that  for  a  large  network,*  the 
following  problems  pose  special  and  important  challenges. 

•  Selection  of  the  number  and  location  of  the  backbone  nodes. 

•  Assignment  of  terminals  and  hosts  to  backbone  switches 

(Partitioning  Problem). 

The  design  of  local  access  networks  may  bo  approached  in  the  same 
way  as  the  design  of  centralized  networks,  for  which  there  exist  good  algo¬ 
rithms.  A  number  of  these  algorithms  have  already  been  discussed  in  this 
paper;  the  reader  is  referred  to  the  appropriate  discussions  for  further 
references. 

Similar  remarks  may  l>o  made  concerning  the  design  of  the  communi¬ 
cations  subnet. 

A  statement  of  the  problem  (from  11SLE7(>)  is  as  follows. 

Given : 

•  User  sites. 

•  Point-to-point  traffic  requirements. 


♦ 

For  discussions  on  the  unique  design  problems  involved  with  large  networks, 
the  reader  would  do  well  to  consult  GERL7.r>  and  FRAN73. 
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•  Backbone  switch  cost  and  characteristics. 

•  Set  of  switch  candidate  sites. 

•  Line  tariff  alternatives. 

Minimi/e ; 

Total  communications  cost  H;  where  1)  trunk  cost  •  switch 
cost  •  local  access  cost. 

t  Her  the  Yn  riablcs  : 

•  Switch  number  and  location. 

•  Backbone  topology. 

Such  that : 

•  Backbone  traffic  requirements  arc  met. 

•  Backbone  delay  constraints  arc  met. 

•  Baeklxtne  reliability  constraints  are  mot. 

•  Switch  capacity  constraints  are  met. 

I  he  following  assumptions  and  approximations  are  used  to 
simplify  the  design  procedure: 

•  Switch  cost  is  assumed  linear  with  capacity. 

•  Bickbono  trunk  cost  is  calculated  using  a  direct  distance  model. 

Those  models  will  lie  presented  which  are  at  the  core  of  the  backbone 
ign  approach. 

1.  Backlxme  Network  Cost  estimation  —  This  model,  which  provides 
an  estimate  of  the  cost  t  particular  design,  is  employed  in 
conjunction  with  the  following  two  algorithms  in  order  to  evaluate 
the  cost  ot  different  configurations  without  suffering  an  enormous 
"run-time"  penalty. 
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2.  An  Add  Algorithm  —  Patterned  after  the  add  algorithm  discussed 
in  section  2,  this  algorithm  provides  a  solution  to  the  backbone 
location  problem. 

3.  A  Cluster  Algorithm  —  Patterned  after  the  COM  algorithm 
discussed  in  this  section,  this  algorithm  also  provides 

a  solution  to  the  location  problem. 

It  is  important  to  note  that  the  fundamental  issue  in  selecting  a  location 
for  a  backbone  switch  is  the  trade-off  between  the  local  access  costs  of 
homing  a  particular  user  node  to  a  switch  which  is  already  in  place,  and  the 
costs  of  adding  another  switch  to  the  backbone. 

1.  Backbone  Network  Cost  Estimation.  The  first  construct  to  be 
identified  is  the  routing  penalty,  P.  This  is  defined  as  the  ratio  of  the 
shortest  path  distance  in  the  network  and  the  direct  (actual)  distance  for  the 
average  source  destination  (i , j )  pair.  P  is  experimentally  determined. 

The  mileage  on  the  route  between  i  and  j,  !r!  ,  may  be  approximated 

bv  R!  =  P  x  d  ,  where  d.  is  the  actual  mileage  between  i  and  j. 
ij  ij 

Given  these  two  expressions,  the  model  for  the  backbone  line  cost 
becomes : 


D  = 


_  cr..  d..  (1  +  b)p 
— 


NN  •  F 
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where  : 

p  average  link  utilization  p.  -  h 

F  fixed  cost  node 

e  eost  mile  x  unit  bandwidth  x  month 
.  traffic  requirement  from  i  to  j 

NN  numlter  of  switches 

b  line  overhead  (protocols) 

d..  actual  mileage  Ix'twoen  i  and  i 

!  he  t i i  st  expression  after  the  double  summation  is  an  estimate  of  the 
line  costs  between  i  and  j,  while  the  second  expression  represents  the 
switch  costs. 

i'he  coefficients  P,  p,  and  F  are  experimentally  determined  for 
different  values  ot  NN. 

The  Add  Algorithm.  Wo  assume  that  the  backbone  locations 
...  ,  o  ,  have  already  been  chosen,  and  that  not  all  user  locations  have 
been  assigned  to  a  switch,  t  et  d  represent  the  set  of  unassigned  users.  We 
then  examine  the  trade-off  of  assigning  each  one  of  the  users  to  either: 

•  A  switch  located  at  the  traffic-weighted  center  of  mass  of 
mass  ,  or 

•  A  switch  that  lias  not  as  yet  been  selected  from  the  list  of 
candidate  Imckbonc  switches. 


1  It'. 


If  T  is  an  arbitrary  user  from  d,  the  saving  S(Q)  in  homing  T  to  Q 


is  given  in  : 


SlQ)  =  LASt(Q)  »  RLSt(Q) 


where  the  tirst  expression  refers  to  the  local  access  saving,  while  the  second 
refers  to  the  backbone  line  saving. 

The  local-access  saving  is  given  by 


I.AS.j.iQ'i  =  k.j,  •  d 


(  r'co) " d 


where  k  cost  mile  for  the  local  access  line. 

The  backbone  line  saving  is  approximated  by 


where : 


k 

Buvf'-vcf--Ev  [d (co-°t) - d • 


t  total  traffic  at  site  T 

t.  -  total  traffic  of  users  assigned  to  switch  0 
i  i 

S' OTA  l  =  to,a^  network  traffic 


c_  =  c  •  — ,  where  these  symbols  have  been  defined  above 

H  p 

e ^  is  essentially  the  "cost  average  route." 
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In  true  add  algorithm  fashion,  we  assign  the  user  sites  to  Q  in  the 
order  of  their  greatest  saving  until  Q's  capacity  is  used  up. 

The  total  savings  from  adding  Q  is  then  given  by 

StQ>  -  S  (Q)  -  C  where  C  is  the  switch  cost. 

T  assigned  to  Q 

I  lie  candidate  with  the  largest  saving  S(Q)  is  then  selected  as  the  next 
switch  site.  In  the  event  that  no  saving  is  possible  among  anv  of  the  candi¬ 
dates,  the  switch  closest  to  the  center  ot  mass  is  chosen  as  the  next  site. 

>1-  1'he  Cluster  Algorithm.  This  algorithm  operates  in  much  the  same 
way  as  the  already-discussed  COM  algorithm  for  TACOM  location.  The 
assumptions  upon  which  its  use  is  based  are: 

a)  Backbone  switch  locations  which  minimize  local  access  cost 
tend  to  minimize  the  total  network  cost. 

Hsieh,  et  al.  feel,  as  a  result  of  their  practical  experience,  that 
the  dominant  factor  in  total  network  cost  is  the  local  access  cost. 

bl  Natural  geographic  groupings  of  nodes  are  a  good  basis  for 
partitioning  the  network. 

Hsieh,  et  al.  have  also  performed  a  comparison  of  the  two  algorithms, 
and  note  that  the  add  algorithm  performs  better  than  the  cluster  algorithm, 
although  it  is  more  costly  to  use.  They  therefore  recommend  its  use  in  the 
event  that  there  are  relatively  few  switch  sites  in  comparison  to  the  number 
of  user  sites.  Given  the  opposite  circumstances,  use  of  the  cluster  algo¬ 
rithm  is  recommended. 

Sum  ma  ry 

NAC  is,  in  the  opinion  of  the  author,  the  preeminent  organization  in 
the  area  of  network  analysis  and  design  in  the  country.  This  is  a  natural 
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consequence  of  the  company’s  history.  Two  of  the  principals,  Howard  Frank 
and  Ivan  Frisch,  were  largely  responsible  for  the  topological  design  of  the 
ARPANET.  While  they  were  professors  at  Berkeley  (prior  to  founding  the 
company),  they  were  active  researchers  in  network  analysis.  This  work 
culminated  in  their  writing  the  standard  text  in  the  field,  Com munication, 
Transportation,  and  Transportation  Networks  [  FRAN71a) .  Dr.  Frisch  is 
presently  editor-in-chief  of  the  journal.  Networks. 

Two  other  well-known  figures  on  their  staff  are  Drs.  Wushow  Chou 
and  Richard  Van  Slvke. 

The  company  has  made  fundamental  contributions  to  the  state  of  the 
art  in: 

•  Distributed  Network  Design  —  They  developed  the  branch  Xchange 
and  cut-saturation  algorithms. 

•  Network  Partitioning  and  Concentrator  Placement  Algorithms  — 
They  developed  the  COM  algorithm. 

•  Line- Layout  Problem  —  The  unified  heuristic  algorithm  is  theirs. 

•  Network  Reliability  —  The  functional  simulation  and 
Moore-Shannon  simulations  are  theirs. 

•  Design  of  Large  Distributed  Networks  —  The  backbone  design 
approach  presented  in  this  paper  appears  to  be  without  rival. 

•  Work  on  packet-radio  and  satellite  communications  has  also 
been  conducted. 

In  addition,  NAC  also  maintains  an  active,  on-going  research  program 


under  the  aegis  of  an  ARPA  contract. 


SUMMARY  AND  CONCLUSIONS 


As  indicated  in  the  introductory  section,  the  objective  of  this  report 
has  been  to  assess  the  efficacy  of  software  packages  available  for  the 
design  of  computer-communications  networks.  Our  approach  to  the  task 
has  been  to  perform  a  comparativ  e  analysis  of  the  algorithms  employed  by 
the  packages  in  the  various  network  design  areas.  As  no  opportunity 
presented  itself  for  the  actual  application  of  any  of  these  packages  to  a 
suitable  Air  Force  problem,  we  have  had  to  l>e  content  with  this  approach. 

Several  major  observations  may  be  made  as  a  result  of  our  compari¬ 
sons.  ( 1  he  reader  may  wish  to  consult  the  summary  charts  appearing  in 
tables  2  and  ;t.  ) 

With  two  notable  exceptions  —  the  VANS  model  and  the  models 
developed  by  Network  Analysis  Corporation  —  the  design  tools  are 
directed  toward  the  design  of  centralized  computer  networks.  This  is  a 
perfectly  understandable  tendency,  since  for  most  of  the  commercial 
market-place,  a  centralized  design  is  what  is  really  required.  The  mili¬ 
tary  situation  is  quite  different  and  will  be  further  addressed  below. 

Another  observation  that  may  readily  lie  made  is  that  those  organiza¬ 
tions  that  only  design  centralized  networks  have  patterned  their  products 
after  IBM's  Communications  Network  Design  Program  (CNDP).  CNDP 
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Specifics  of  Implementation 


is  available  only  in  the  form  of  an  IHM  marketing  tool,  and  as  such  is  not 
available  for  study.  On  the  other  hand,  its  algorithms  have  been  exten¬ 
sively  described  by  .lames  Martin  [MART72]  of  the  HIM  Systems  Research 
Institute,  Consequently,  the  line-layout  problem  is  approached  via  the 
Ksau- Williams  algorithm  (developed  for  CNP1’)  and  the  concentrator  place¬ 
ment  problem  is  solved  employing  an  add  algorithm,  as  is  also  done  in 
(.'NOP. 

\  serious  shortcoming  in  all  ol  these  packages,  from  the  military 
point  ot  view,  is  tho  lack  ol  network  reliability  (survivability)  programs. 

With  the  exception  ol  Scientific  1'imc  Sharing  Corporation’s  model, 
all  of  the  "central  network"  companies  have  written  their  programs  in 
f  OR  I  RAN.  Most  ot  them  are  available  in  a  time-sharing  mode.  The  STS 
model  is  written  in  A I  I  ,.  1  able  •>  is  a  summary  of  the  specific  implemen¬ 

tation  details  of  each  of  the  packages. 

In  noting  some  ot  the  special  requirements  that  must  bo  made  of  a 
computer-communications  network  in  the  introduction  to  this  paper,  wo 
listed  lour  characteristics  good  response  time  throughput  characteristics, 
survivability,  expandability,  and  cost-effectiveness.  Although  the  list 
could  easily  be  extended,  characteristics  such  as  these  indicate  the 
necessity  for  a  sophisticated  technology  —  the  ability  to  design  survivable, 
distributed,  packet-switching  networks,  for  example.  A  corollary  to  this 
is  the  absolute  necessity  ol  possessing  the  best  available  design  tools, 
which  can  make  ollootivo  use  of  all  the  options  a  rapidly  changing  technology 
makes  available.  A  second  corollary  is  the  support  of  a  first-rate  consult¬ 
ing  stall  to  go  along  with  these  tools,  as  all  problems  are  really  unique 
and  pose  their  own  special  challenges. 


It  is  the  opinion  of  the  author  that  Network  Analysis  Corporation 
is  the  preeminent  organization  in  the  area  of  designing  computer- 
communications  networks.  Indeed,  the  company  may  he  said  to  have 
created  the  state  of  the  art  in  many  of  the  import  design  areas.  NAC  is 
alone  in  offering  a  tool  for  the  design  of  a  truly  distributed,  packet- 
switched  network,  alone  in  offering  reliability  models,  and  alone  in 
ottering  tools  for  the  backbone  design  of  a  large  network.  In  addition, 
their  staft  is  composed  of  acknowledged  experts  in  the  area. 

The  VANS  model,  created  b\  Professor  M.  Schneider  of  the 
University  of  Minnesota,  appears  to  have  great  potential  for  answering 
questions  in  the  area  of  protocol  design. 

both  of  these  models  would  make  excellent  choices  for  application 
in  any  program  being  conducted  by  KSH  in  the  area  of  network  design. 
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